For what it’s worth, the model showcased in December (then called o3) seems to be completely different from the model that METR benchmarked (now called o3).
bhalstead
Karma: 8
- bhalstead 4 May 2025 0:19 UTC3 points0in reply to: Thane Ruthenis’s comment on: Thane Ruthenis’s Shortform
Registering that I don’t expect GPT-5 to be “the biggest model release of the year,” for various reasons. I would guess (based on the cost and speed) that the model is GPT-4.1-sized. Conditional on this, the total training compute is likely to be below the state of the art.