That’s fair—the current work focuses on vision models. But I’d argue it provides a concrete mechanism that could generalize to other domains. The idea that a hidden linear model lives within the activation structure isn’t specific to images; what’s needed next is to recover and validate it in text or multimodal settings.
In other words: yes, it’s scoped, but it opens a door.
Idk, this seem fairly limited. I’d be excited for work which extends this beyond image models.
That’s fair—the current work focuses on vision models. But I’d argue it provides a concrete mechanism that could generalize to other domains. The idea that a hidden linear model lives within the activation structure isn’t specific to images; what’s needed next is to recover and validate it in text or multimodal settings.
In other words: yes, it’s scoped, but it opens a door.