Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
evhub comments on
Training on Documents About Reward Hacking Induces Reward Hacking
evhub
22 Jan 2025 1:46 UTC
LW: 8 AF: 6
2
AF
I’m definitely very interested in trying to test that sort of conjecture!
Back to top
I’m definitely very interested in trying to test that sort of conjecture!