Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Burny comments on
Jesse Hoogland’s Shortform
Burny
22 Jan 2025 3:36 UTC
9
points
2
No MCTS, no PRM...
scaling up CoT with simple RL and scalar rewards...
emergent behaviour
Back to top
No MCTS, no PRM...
scaling up CoT with simple RL and scalar rewards...
emergent behaviour