Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Bartosz Cywiński
Karma:
54
MATS 8.0 scholar with Arthur Conmy and Sam Marks
All
Posts
Comments
New
Top
Old
Eliciting secret knowledge from language models
Bartosz Cywiński
,
Arthur Conmy
and
Sam Marks
2 Oct 2025 20:57 UTC
68
points
3
comments
2
min read
LW
link
(arxiv.org)
Back to top