Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Aditya Shrivastava
Karma:
30
All
Posts
Comments
New
Top
Old
3 Challenges and 2 Hopes for the Safety of Unsupervised Elicitation
Callum Canavan
,
Aditya Shrivastava
,
Allison Qi
,
Jonathan Michala
and
Fabien Roger
27 Feb 2026 17:25 UTC
21
points
0
comments
10
min read
LW
link
Eliciting base models with simple unsupervised techniques
Callum Canavan
,
Aditya Shrivastava
,
Allison Qi
,
Tianyi (Alex) Qiu
,
Jonathan Michala
and
Fabien Roger
23 Jan 2026 18:06 UTC
34
points
2
comments
8
min read
LW
link
Back to top