Timothy Underwood comments on ML Systems Will Have Weird Failure Modes

Timothy Underwood 16 Feb 2022 14:14 UTC
1 point
0
Yeah, but don’t you expect successful human equivalent neural networks to have some sort of loop involved? It seems pretty likely to me that the ML researchers will successfully figure out how to put self analysis loops into neural nets.
- delton137 18 Feb 2022 16:25 UTC
  3 points
  1
  Parent
  Networks with loops are much harder to train.. that was one of the motivations for going to transformers instead of RNNs. But yeah, sure, I agree. My objection is more that posts like this are so high level I have trouble following the argument, if that makes sense. The argument seems roughly plausible but not making contact with any real object level stuff makes it a lot weaker, at least to me. The argument seems to rely on “emergence of self-awareness / discovery of malevolence/deception during SGD” being likely which is unjustified in my view. I’m not saying the argument is wrong, more that I personally don’t find it very convincing.