Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Caleb Biddulph comments on
Why not train reasoning models with RLHF?
Caleb Biddulph
30 Jan 2025 21:12 UTC
1
point
0
Yeah, but there are probably other interesting takeaways
Back to top
Yeah, but there are probably other interesting takeaways