beren comments on The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable