Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Charlie George
Karma:
5
All
Posts
Comments
New
Top
Old
Using mechanistic interpretability to find in-distribution failure in toy transformers
Charlie George
28 Nov 2022 19:39 UTC
6
points
0
comments
4
min read
LW
link
Back to top