Mis-Understandings comments on Mis-Understandings’s Shortform

Mis-Understandings 28 Mar 2025 19:53 UTC
1 point
0
- This is powerful evidence that even though models are trained to output one word at a time, they may think on much longer horizons to do so.
from anthropics most recent release, mainly was the thought.
I was trying to fit that into how that behaviour shows up.