Maybe_a comments on larger language models may disappoint you [or, an eternally unfinished draft]

Maybe_a 10 Dec 2022 12:59 UTC
3 points
0
It’s a fine overview of modern language models. Idea of scaling all the skills at the same time is highlighted, different from human developmental psychology. Since publishing 500B-PaLM models seemed to have jumps at around 25% of the tasks of BIG-bench.
Inadequacy of measuring average performance on LLM is discussed, where a proportion is good, and rest is outright failure from human PoV. Scale seems to help with rate of success.