Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Oliver Daniels comments on
Training a Reward Hacker Despite Perfect Labels
Oliver Daniels
17 Aug 2025 22:46 UTC
1
point
0
Yup the latter (post-recontextualized-training model)
Back to top
Yup the latter (post-recontextualized-training model)