Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
hrdkbhatnagar
Karma:
145
All
Posts
Comments
New
Top
Old
(Not) Explaining GPT-2-Small Forward Passes with Edge-Level Autoencoder Circuits
David Udell
,
hrdkbhatnagar
and
JacksonKaunismaa
22 Jul 2025 20:36 UTC
23
points
0
comments
6
min read
LW
link
Compositionality and Ambiguity: Latent Co-occurrence and Interpretable Subspaces
Matthew A. Clarke
,
hrdkbhatnagar
and
Joseph Bloom
20 Dec 2024 15:16 UTC
34
points
0
comments
37
min read
LW
link
Toy Models of Feature Absorption in SAEs
chanind
,
hrdkbhatnagar
,
TomasD
and
Joseph Bloom
7 Oct 2024 9:56 UTC
49
points
8
comments
10
min read
LW
link
[Paper] A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders
chanind
,
TomasD
,
hrdkbhatnagar
and
Joseph Bloom
25 Sep 2024 9:31 UTC
73
points
16
comments
3
min read
LW
link
(arxiv.org)
Back to top