Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Alan Cooney
Karma:
119
All
Posts
Comments
New
Top
Old
“Did you lie?” Evaluating Lie Detectors across Model Scale and Belief-Verified Model Organisms
Alan Cooney
,
David Africa
and
Geoffrey Irving
17 Jun 2026 18:43 UTC
32
points
0
comments
6
min read
LW
link
(arxiv.org)
vLLM-Lens: Fast Interpretability Tooling That Scales to Trillion-Parameter Models
Alan Cooney
and
Sid Black
23 Apr 2026 19:13 UTC
76
points
0
comments
5
min read
LW
link
Research Areas in AI Control (The Alignment Project by UK AISI)
Julian Stastny
,
Tomek Korbak
,
Mojmir
,
Buck
and
Alan Cooney
1 Aug 2025 10:27 UTC
25
points
0
comments
18
min read
LW
link
(alignmentproject.aisi.gov.uk)
Back to top