Buck comments on Buck’s Shortform

Buck 27 Mar 2026 23:21 UTC
LW: 18 AF: 8
3
AF
- OAI models rely more on CoT for their capabilities. E.g. their benchmark scores with and without CoT are more different.
- Anthropic models treat their CoT less differently from their output than OAI models do. This means that RL probably pressures their CoT more. See here.