Charlie Steiner comments on Do LLMs dream of emergent sheep?

Charlie Steiner 25 Apr 2023 21:50 UTC
3 points
0
This seems to be a consequence of having a large but not-actually-that-deep-in-serial-steps net trained on next token prediction of a big pile of human data. AI doesn’t have to be like that—I expect something that can competently choose which cognitive strategies to execute will be much better at multiplication than a human, but it’s hard to get to that kind of AI by predictive training on a big pile of human data.
- Shmi 25 Apr 2023 22:37 UTC
  2 points
  2
  Parent
  I think this is the point. Existing training creates something like System 1, which now happens to match what humans find “natural”. Something else is probably needed to make math “natural” for ML models.