Roman Leventov comments on chinchilla’s wild implications

Roman Leventov 1 Aug 2022 10:14 UTC
24 points
10
My two cents contra updates towards longer or more uncertain AGI timelines given the information in this post:
- The training of language models is many orders of magnitude less efficient than the training of the human brain, which acquires comparable language comprehension and generation ability on a tiny fraction of the text corpora discussed in this post. So we can expect more innovations that improve the training efficiency. Even one such innovation, improving the training efficiency (in terms of data) by a single order of magnitude, would probably ensure that the total size of publically available text data is not a roadblock on the path to AGI, even if it is, currently. I think the probability that we will see at least one such innovation in the next 5 years is quite high, more than 10%.
- Perhaps DeepMind’s Gato is already a response to the realisation that “there is not enough text”, explained in this post. So they train Gato on game replays, themselves generated programmatically, using RL agents. They can generate practically unlimited amounts of training data in this way. Then there is probably a speculation that at some scale, Gato will generalise the knowledge acquired in games to text, or will indeed enable much more efficient training on text, (a-la few-shot learning in current LMs) if the model is pre-trained on games and other tasks.
- Daniel Kokotajlo 24 Mar 2026 20:11 UTC
  5 points
  0
  Parent
  Blast from the past! I had forgotten how a bunch of people updated towards longer timelines because of Chinchilla!