Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
fakeanalyst
Karma:
2
All
Posts
Comments
New
Top
Old
fakeanalyst
31 Jul 2025 18:33 UTC
3
points
0
in reply to:
Buck
’s
comment
on:
Buck’s Shortform
The usefulness of interpretability research
fakeanalyst
16 May 2025 20:40 UTC
1
point
0
on:
Generating the Funniest Joke with RL (according to GPT-4.1)
Goodhart’s law!
Back to top
The usefulness of interpretability research