Yes, but it still makes sense to me. RL and ‘llm thinking’ are pretty basic heuristics at this point and I don’t expect them to circumvent exponential running time requirements for many problems they implicitly solve at the same time.
Yes, but it still makes sense to me. RL and ‘llm thinking’ are pretty basic heuristics at this point and I don’t expect them to circumvent exponential running time requirements for many problems they implicitly solve at the same time.