Tao Lin comments on How Fast Can Algorithms Advance Capabilities? | Epoch Gradient Update

Tao Lin 2 Aug 2025 20:38 UTC
1 point
0
I heard some rumors that gpt 4.5 got good pretraining loss but bad downstream performance. If that’s true the loss scaling laws may have worked correctly. If not, yeah a lot of things can go wrong and something did, whether that’s hardware issues, software bugs, or machine learning problems or problems with their earlier experiments