Daniel Tan comments on Mis-Understandings’s Shortform

Daniel Tan 28 Mar 2025 16:08 UTC
2 points
0
I found this hard to read. Can you give a concrete example of what you mean? Preferably with a specific prompt + what you think the model should be doing
- Mis-Understandings 28 Mar 2025 19:53 UTC
  1 point
  0
  Parent
  - This is powerful evidence that even though models are trained to output one word at a time, they may think on much longer horizons to do so.
  from anthropics most recent release, mainly was the thought.
  I was trying to fit that into how that behaviour shows up.