Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Jiaxin Wen comments on
Auditing language models for hidden objectives
Jiaxin Wen
4 Apr 2025 21:20 UTC
1
point
0
interesting! do you mean experiments in Sec 3.9.2?
Back to top
interesting! do you mean experiments in Sec 3.9.2?