Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Aaryan Chandna
Karma:
28
All
Posts
Comments
New
Top
Old
Aaryan Chandna
30 Nov 2025 18:43 UTC
1
point
0
on:
Natural emergent misalignment from reward hacking in production RL
Interesting, would be cool to know which base model was used!
Back to top
Interesting, would be cool to know which base model was used!