RogerDearnaley comments on LLMs struggle to verbalize their internal reasoning

RogerDearnaley 15 Feb 2026 20:17 UTC
2 points
0
We study to what extent LLMs can verbalize their internal reasoning.
Or, to be more accurate, you study to what extent two quite small, non-reasoning-trained, fairly recent dense models can do this. Whether you results would generalize to other LLMs is quite unclear — I strongly suspect reasoning models might do a lot better at this. perhaps even if they were initially run with CoT off.
- Emil Ryd 16 Feb 2026 3:02 UTC
  1 point
  0
  Parent
  I agree that training on more LLMs would be good. However, I would like to note that gpt-oss-20b, one of the models we train on is a sparse, MoE reasoning model, released half a year ago.
  - RogerDearnaley 16 Feb 2026 4:01 UTC
    3 points
    0
    Parent
    My apologies, I stand corrected. Then your model choice makes a lot more sense than I thought. And I am now more surprised by your results.