Charlie Steiner comments on Laying the Foundations for Vision and Multimodal Mechanistic Interpretability & Open Problems