neverix comments on neverix’s Shortform

neverix 24 Jun 2026 15:55 UTC
9 points
0
Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders
Some support for the hypothesis that SAE feature instability is caused by the autoencoder tiling a manifold in unique ways. Doesn’t attempt to actually find and describe the manifold, but suggests doing so would be worthwhile.