paulfchristiano comments on [link] New essay summarizing some of my latest thoughts on AI safety

paulfchristiano 15 Nov 2015 1:15 UTC
0 points
0

delicate symmetry-breaking which can only come from either the training procedure or noise in the data, rather than the model itself

I’m still not convinced. The pointwise nonlinearities introduce a preferred basis, and cause the individual hidden units to be much more meaningful than linear combinations thereof.
- jsteinhardt 15 Nov 2015 7:48 UTC
  0 points
  0
  Parent
  Yeah; I discussed this with some others and came to the same conclusion. I do still think that one should explain why the preferred basis ends up being as meaningful as it does, but agree that this is a much more minor objection.