Lucius Bushnaq comments on Taking the parameters which seem to matter and rotating them until they don’t

Lucius Bushnaq 1 Sep 2022 12:03 UTC
2 points
0
Sure, but that’s not a question I’m primarily interested in. I don’t want the most interpretable basis, I want the basis that network itself uses for thinking. My goal is to find the elementary unit of neural networks, to build theorems and eventually a whole predictive theory of neural network computation and selection on top of.
That this may possibly make current networks more human-interpretable even in the short run is just a neat side benefit to me.
- Tom Lieberum 1 Sep 2022 13:09 UTC
  1 point
  0
  Parent
  Ah, I might have misunderstood your original point then, sorry!
  I’m not sure what you mean by “basis” then. How strictly are you using this term?
  I imagine you are basically going down the “features as elementary unit” route proposed in Circuits (although you might not be pre-disposed to assume features are the elementary unit).Finding the set of features used by the network and figuring out how its using them in its computations does not 1-to-1 translate to “find the basis the network is thinking in” in my mind.
  - Lucius Bushnaq 1 Sep 2022 14:27 UTC
    5 points
    1
    Parent
    I imagine you are basically going down the “features as elementary unit” route proposed in Circuits (although you might not be pre-disposed to assume features are the elementary unit).Finding the set of features used by the network and figuring out how its using them in its computations does not 1-to-1 translate to “find the basis the network is thinking in” in my mind.
    Fair enough, imprecise use of language. For some definitions of “thinking” I’d guess a small vision CNN isn’t thinking anything.