Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Archie Chaudhury
Karma:
53
All
Posts
Comments
New
Top
Old
The slow death of the accelerationist.
Archie Chaudhury
6 Apr 2026 3:40 UTC
3
points
1
comment
5
min read
LW
link
Teaching Models to Dream of Better Monitors through Evaluator Conditioned Training
Alec Harris
,
Kasey C
,
Archie Chaudhury
and
yix
19 Mar 2026 21:01 UTC
41
points
2
comments
10
min read
LW
link
A Rational Proposal
Archie Chaudhury
26 Jan 2026 20:22 UTC
−2
points
0
comments
14
min read
LW
link
Alignment may be localized: a short (and albeitly limited) experiment
Archie Chaudhury
24 Nov 2025 17:48 UTC
18
points
0
comments
5
min read
LW
link
Interpretability is the best path to alignment
Archie Chaudhury
5 Sep 2025 4:37 UTC
2
points
6
comments
5
min read
LW
link
Steering Vectors Can Help LLM Judges Detect Subtle Dishonesty
Leon Eshuijs
,
mcbeth
,
Etha
and
Archie Chaudhury
3 Jun 2025 20:33 UTC
12
points
1
comment
5
min read
LW
link
Arch223′s Shortform
Archie Chaudhury
18 Nov 2024 1:54 UTC
1
point
1
comment
1
min read
LW
link
Back to top