Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
AdeOlu
Karma:
0
Interested in mechanistic interpretability, linguistic philosophy, agents, graphs and boxing.
All
Posts
Comments
New
Top
Old
Abstention Geometry: Knowledge and Behaviour Are Dissociable in Llama 3.1 8B
AdeOlu
31 May 2026 10:41 UTC
1
point
0
comments
9
min read
LW
link
Back to top