Steven Byrnes comments on The nature of LLM algorithmic progress (v2)

Steven Byrnes 8 May 2026 2:10 UTC
2 points
0
Anson Ho’s blog post picked up on that too:
There are also some suspicious aspects to this study: for example, it uses scaling experiments between 10¹³ FLOP and 10¹⁸ FLOP (or under 10¹⁷ FLOP for LSTMs) to infer efficiency improvements as far as 10²³ FLOP — so it involves heroically extrapolating out five orders of magnitude. This is a problem because the result about scale-dependence might itself depend on which compute scale we’re looking at, which is very meta.
I think minor inaccuracies in the Gundlach 2025b accounting are quite likely and indeed mentioned that at one point in my OP.