Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Elle Najt
Karma:
121
All
Posts
Comments
New
Top
Old
Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability
Elle Najt
,
Asa Cooper Stickland
and
Xander Davies
17 Apr 2026 19:30 UTC
72
points
6
comments
15
min read
LW
link
How Unmonitored External Agents can Sabotage AI labs
Elle Najt
and
Fabien Roger
9 Apr 2026 18:07 UTC
23
points
0
comments
9
min read
LW
link
Opus’s Schelling Steganography Has Amplifiable Secrecy Against Weaker Eavesdroppers
Elle Najt
7 Apr 2026 6:01 UTC
33
points
2
comments
36
min read
LW
link
The Goldborg Variations: Algorave Attractor States of LLMs
Elle Najt
1 Mar 2026 4:36 UTC
7
points
0
comments
7
min read
LW
link
Back to top