Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Benjamin Arnav
Karma:
106
https://benjaminarnav.com
All
Posts
Comments
New
Top
Old
Ensemble monitoring for AI control: diverse signals outweigh more compute
Yejun Y.
,
Sam Tetef
,
eugenekoran
,
Benjamin Arnav
and
Pablo Bernabeu-Pérez
31 May 2026 1:21 UTC
12
points
0
comments
7
min read
LW
link
Toward a Better Evaluations Ecosystem
Benjamin Arnav
5 May 2026 22:29 UTC
24
points
0
comments
5
min read
LW
link
Unfaithful Reasoning Can Fool Chain-of-Thought Monitoring
Benjamin Arnav
,
Pablo Bernabeu-Pérez
,
Tim Kostolansky
,
HanneWhitt
,
Nathan Helm-Burger
and
Mary Phuong
2 Jun 2025 19:08 UTC
78
points
17
comments
3
min read
LW
link
Back to top