[Question] Favorite /​ most obscure research on understanding DNNs?

Looking for:

  1. Any results /​ lines of research I haven’t heard of in areas like interpretability, inductive biases, and ML theory

  2. Obscure papers or posts on things like double descent, simplicity bias, induction heads, and grokking

No comments.