Douglas Summers-Stay comments on Measuring no CoT math time horizon (single forward pass)

Douglas Summers-Stay 1 Jan 2026 12:22 UTC
2 points
0
I’m not sure this constraint (single forward pass as opposed to reasoning-enabled) is very meaningful. If it is still doing the reasoning as it answers the question, does it make a big difference whether that is enclosed by <thinking></thinking> tags? I would be interested to see a version of this where the answer has to be the first token output. That would show how much thinking it can do in one step.
- ryan_greenblatt 1 Jan 2026 13:32 UTC
  5 points
  2
  Parent
  The model can not output any tokens before answering, it has to respond immediately. This is what I mean by “no CoT”.