Zack_M_Davis comments on faul_sname’s Shortform

Zack_M_Davis 23 Oct 2025 6:00 UTC
2 points
0
The Sonnet 4.5 system card reiterates the “most thought processes are short enough to display in full” claim that you quote:

As with Claude Sonnet 4 and Claude Opus 4, thought processes from Claude Sonnet 4.5 are summarized by an additional, smaller model if they extend beyond a certain point (that is, after this point the “raw” thought process is no longer shown to the user). However, this happens in only a very small minority of cases: the vast majority of thought processes are shown in full.

But it is intriguing that the displayed Claude CoTs are so legible and “non-weird” compared to what we see from DeepSeek and ChatGPT. Is Anthropic using a significantly different (perhaps less RL-heavy) post-training setup?