Vinayak Pathak comments on You’re Measuring Model Complexity Wrong

Vinayak Pathak 19 Feb 2025 22:36 UTC
1 point
0
AF
Neural networks are intrinsically biased towards simpler solutions.
Am I correct in thinking that being “intrinsically biased towards simpler solutions” isn’t a property of neural networks, but a property of the Bayesian learning procedure? The math in the post doesn’t use much about NN’s and it seems like the same conclusions can be drawn for any model class whose loss landscape has many minima with varying complexities?
- Jesse Hoogland 19 Feb 2025 23:45 UTC
  LW: 4 AF: 2
  0
  AF Parent
  To be precise, it is a property of singular models (which includes neural networks) in the Bayesian setting. There are good empirical reasons to expect the same to be true for neural networks trained with SGD (across a wide range of different models, we observe the LLC progressively increase from ~0 over the course of training).