I think this inability of “learning while thinking” might be the key missing thing of LLMs and I am not sure “thought assessment” or “sequential reasoning” are not red herrings compared to this. What good is assessment of thoughts if you are fundamentally limited in changing them? Also, reasoning models seem to do sequential reasoning just fine as long as they already have learned all the necessary concepts.
I think this inability of “learning while thinking” might be the key missing thing of LLMs and I am not sure “thought assessment” or “sequential reasoning” are not red herrings compared to this. What good is assessment of thoughts if you are fundamentally limited in changing them? Also, reasoning models seem to do sequential reasoning just fine as long as they already have learned all the necessary concepts.