jacob_cannell comments on What’s the evidence that LLMs will scale up efficiently beyond GPT4? i.e. couldn’t GPT5, etc., be very inefficient?

jacob_cannell 24 Nov 2023 17:30 UTC
2 points
0
There are two key subquestions here: the scaling function of better at X with respect to net training compute, and what exactly X entails.

The X here is ‘predict internet text’, not “generate new highly valuable research etc”, and success at the latter likely requires combining LLMs with at least planning/search.