RSS

RGRGRG

Karma: 138

Cross-Layer Transcoders are in­cen­tivized to learn Un­faith­ful Circuits

2 Feb 2026 21:32 UTC
40 points
6 comments18 min readLW link

Alter­na­tive Models of Superposition

11 Aug 2025 15:52 UTC
20 points
6 comments5 min readLW link

Seek­ing Feed­back on My Mechanis­tic In­ter­pretabil­ity Re­search Agenda

RGRGRG12 Sep 2023 18:45 UTC
5 points
1 comment3 min readLW link