Adam Jermyn comments on The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable

Adam Jermyn 30 Nov 2022 22:26 UTC
LW: 6 AF: 4
0
AF
This is really interesting! One extension that comes to mind: SVD will never recover a Johnson-Lindenstrauss packing, because SVD can only return as many vectors as the rank of the relevant matrix. But you can do sparse coding to e.g. construct an overcomplete basis of vectors such that typical samples are sparse combinations of those vectors. Have you tried/considered trying something like that?
- beren 2 Dec 2022 17:19 UTC
  2 points
  0
  Parent
  Yes, this is correct. SVD necessarily won’t recover the full JL packing. Given that we don’t know the extent to which the network uses the full JL capacity, then SVD might still get a reasonable fraction of the relevant directions. Also, if the network packs semantically similar vectors close to one another, then the SVD direction might also represent some kind of useful average of them.
  Indeed, we are looking at sparse coding to try to construct an over complete basis, as a parallel project. Stay tuned for this.