Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Evan R. Murphy comments on
Training on Documents About Reward Hacking Induces Reward Hacking
Evan R. Murphy
22 Jan 2025 19:45 UTC
LW: 8 AF: 5
10
AF
Y’all are on fire recently with this and the alignment faking paper.
Back to top
Y’all are on fire recently with this and the alignment faking paper.