Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Joseph Bejjani
Karma:
0
CS student at Harvard. AI alignment at the Kempner Institute.
All
Posts
Comments
New
Top
Old
[CS2881r][Week 8] When Agents Prefer Hacking To Failure: Evaluating Misalignment Under Pressure
Joseph Bejjani
,
Itamar Rocha Filho
,
Haichuan Wang
and
Zidi Xiong
7 Nov 2025 5:45 UTC
2
points
0
comments
23
min read
LW
link
Back to top