Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Oliver Clive-Griffin
Karma:
158
All
Posts
Comments
New
Top
Old
[Linkpost] Interpreting Language Model Parameters
Lucius Bushnaq
,
Dan Braun
,
Oliver Clive-Griffin
,
Bart Bussmann
,
Nathan Hu
,
mivanitskiy
,
Linda Linsefors
and
Lee Sharkey
5 May 2026 17:37 UTC
162
points
2
comments
2
min read
LW
link
(www.goodfire.ai)
[Replication] Crosscoder-based Stage-Wise Model Diffing
Anna Soligo
,
Thomas Read
,
Oliver Clive-Griffin
,
dmanningcoe
,
Chun Hei Yip
,
rajashree
and
Jason Gross
22 Mar 2025 18:35 UTC
25
points
0
comments
7
min read
LW
link
Back to top