Eli Tyre comments on Algorithmic Improvement Is Probably Faster Than Scaling Now

Eli Tyre 6 Apr 2024 20:02 UTC
2 points
0
I haven’t seen careful analysis of LLMs (probably because they’re newer, so harder to fit a trend), but eyeballing it… Chinchilla by itself must have been a factor-of-4 compute-equivalent improvement at least.
Ok, but discovering the Chinchilla scaling laws is a one time boost to training efficiency. You should expect to repeatedly get 4x improvements because you observed that one.
- johnswentworth 7 Apr 2024 0:09 UTC
  2 points
  0
  Parent
  Every algorithmic improvement is a one-time boost.