Lee Sharkey comments on Efficient Dictionary Learning with Switch Sparse Autoencoders

Lee Sharkey 26 Jul 2024 11:06 UTC
3 points
0
Both of these seem like interesting directions (I had parameters in mind, but params and activations are too closely linked to ignore one or the other). And I don’t have a super clear idea but something like representational similarity analysis between SwitchSAEs and regular SAEs could be interesting. This is just one possibility of many though. I haven’t thought about it for long enough to be able to list many more, but it feels like a direction with low hanging fruit for sure. For papers, here’s a good place to start for RSA: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3730178/
- phenomanon 26 Jul 2024 22:19 UTC
  1 point
  0
  Parent
  Thank you very much for your reply—I appreciate the commentary and direction