Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
bilalchughtai comments on
Activation space interpretability may be doomed
bilalchughtai
12 Jan 2025 19:24 UTC
5
points
4
Also related to the idea that the best linear SAE encoder is not the transpose of the decoder.
Back to top
Also related to the idea that the best linear SAE encoder is not the transpose of the decoder.