Rohin Shah comments on [AN #148]: Analyzing generalization across more axes than just accuracy or loss