ryan_greenblatt comments on Recent LLMs can use filler tokens or problem repeats to improve (no-CoT) math performance

ryan_greenblatt 22 Dec 2025 22:14 UTC
LW: 45 AF: 25
0
AF
Recall that without filler, Opus 4.5 performance is 45.2%. I tried the following experiments on Opus 4.5 with filler counting to 300:
- Default (what I do by default in the blog post above): 51.1%
- Remove the text explaining filler tokens (as in, cut “After the problem …”): 50.4%
- Use “After the problem, there will be distractor tokens (counting from 1 to {filler_tokens})”: 51.1%
- Don’t actually use filler tokens, but include in the prompt “After the problem, there will be filler tokens (counting from 1 to 300) to give you extra space to process the problem before answering” (as in, this is just a lie, we don’t give filler): 45.8%
So, it seems like the framing doesn’t matter ~at all and actually having the filler tokens is the key thing (at least for Opus 4.5, though I strongly expect this would reproduce for Opus 4, Sonnet 4).
- Gurkenglas 23 Dec 2025 22:35 UTC
  9 points
  2
  Parent
  Please try not to lie to the models. You can truthfully say “After the problem, there will be a [p]% chance of filler tokens (counting from 1 to 300) to give you extra space to process the problem before answering.” and do observational statistics.
  What links here?
  - Gurkenglas's comment on Do Models Continue Misaligned Actions? [eval] by Jordan Taylor (10 Feb 2026 0:21 UTC; 2 points)