RSS

hrdkbhatnagar

Karma: 145

(Not) Ex­plain­ing GPT-2-Small For­ward Passes with Edge-Level Au­toen­coder Circuits

22 Jul 2025 20:36 UTC
23 points
0 comments6 min readLW link

Com­po­si­tion­al­ity and Am­bi­guity: La­tent Co-oc­cur­rence and In­ter­pretable Subspaces

20 Dec 2024 15:16 UTC
34 points
0 comments37 min readLW link

Toy Models of Fea­ture Ab­sorp­tion in SAEs

7 Oct 2024 9:56 UTC
49 points
8 comments10 min readLW link

[Paper] A is for Ab­sorp­tion: Study­ing Fea­ture Split­ting and Ab­sorp­tion in Sparse Autoencoders

25 Sep 2024 9:31 UTC
73 points
16 comments3 min readLW link
(arxiv.org)