Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
mivanitskiy
Karma:
145
https://mivanit.github.io
All
Posts
Comments
New
Top
Old
[Linkpost] Interpreting Language Model Parameters
Lucius Bushnaq
,
Dan Braun
,
Oliver Clive-Griffin
,
Bart Bussmann
,
Nathan Hu
,
mivanitskiy
,
Linda Linsefors
and
Lee Sharkey
5 May 2026 17:37 UTC
127
points
2
comments
2
min read
LW
link
(www.goodfire.ai)
Understanding mesa-optimization using toy models
tilmanr
,
rusheb
,
Guillaume Corlouer
,
Dan Valentine
,
afspies
,
mivanitskiy
and
Can
7 May 2023 17:00 UTC
46
points
6
comments
10
min read
LW
link
Back to top