anaguma comments on It turns out that DNNs are remarkably interpretable.

anaguma 4 Aug 2025 22:30 UTC
1 point
2
Idk, this seem fairly limited. I’d be excited for work which extends this beyond image models.
- Maciej Satkiewicz 4 Aug 2025 22:50 UTC
  2 points
  0
  Parent
  That’s fair—the current work focuses on vision models. But I’d argue it provides a concrete mechanism that could generalize to other domains. The idea that a hidden linear model lives within the activation structure isn’t specific to images; what’s needed next is to recover and validate it in text or multimodal settings.
  
  In other words: yes, it’s scoped, but it opens a door.