Rohin Shah comments on How can Interpretability help Alignment?