Steven Byrnes comments on Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc

Steven Byrnes 14 Dec 2023 16:16 UTC
LW: 16 AF: 8
2
AF
I think this post makes a true and important point, a point that I also bring up from time to time.
I do have a complaint though: I think the title (“Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc”) is too strong. (This came up multiple times in the comments.)
In particular, suppose it takes N unlabeled parameters to solve a problem with deep learning, and it takes M unlabeled parameters to solve the same problem with probabilistic programming. And suppose that M<N, or even M<<N, which I think is generally plausible.
If Person X notices that M<<N, and then declares “deep learning is less interpretable than probabilistic programming”, well that’s not a crazy thing for them to say. And if M=5 and N=5000, then I think Person X is obviously correct, whereas the OP title is wrong. On the other hand, if M is a trillion and N is a quadrillion, then presumably the situation is that basically neither is interpretable, and maybe Person X’s statement “deep learning is less interpretable than probabilistic programming” is still maybe literally true on some level, but it kinda gives the wrong impression, and the OP title is perhaps more appropriate.
Anyway, I think a more defensible title would have been “Logic / Probability / Etc. Systems can be giant inscrutable messes too”, or something like that.
Better yet, the text could have explicitly drawn a distinction between what probabilistic programming systems typically look like today (i.e., a handful of human-interpretable parameters), and what they would look like if they were scaled to AGI (i.e. billions of unlabeled nodes and connections inferred from data, or so I would argue).
What links here?
- rotatingpaguro 11 Aug 2025 6:53 UTC
  3 points
  −1
  Parent
  I’m jumping to reply here having read the post in the past and without re-reading the post and the discussion, so maybe I’ll be redundant. With that said:
  
  I think that nonparametric probabilistic programming will in general have the same order of number of parameters as DL. The number of parameters can be substantially lower only when you have a neat simplified model, either because
  1. you understand the process to the point of constraining it so much, for example in physics experiments
  2. it’s hopeless to extract more information than that, in other words it’s noisy data, so a simple model suffices

Steven Byrnes comments on Deep Learning Systems Are Not Less Interpretable Than Logic/​Probability/​Etc

Steven Byrnes comments on Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc