Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Chris Lakin answers
Examples of self-fulfilling prophecies in AI alignment?
Chris Lakin
3 Mar 2025 2:46 UTC
8
points
0
Training on Documents About Reward Hacking Induces Reward Hacking
Back to top
Training on Documents About Reward Hacking Induces Reward Hacking