james oofou comments on Habryka’s Shortform Feed

james oofou 29 Jun 2025 21:03 UTC
13 points
11
which was clearly supposed to have been GPT-5
I have seen people say this many times, but I don’t understand. What makes it so clear?
GPT-4.5 is roughly a 10x scale-up of GPT-4, right? And full number jumps in GPT have always been ~100x? So GPT-4.5 seems like the natural name for OpenAI to go with.
I do think it’s clear that OpenAI viewed GPT-4.5 as something of a disappointment, I just haven’t seen anything indicating that they at some point planned to break the naming convention in this way.
- gwern 29 Jun 2025 23:09 UTC
  14 points
  1
  Parent
  
  GPT-4.5 is roughly a 10x scale-up of GPT-4, right? And full number jumps in GPT have always been ~100x? So GPT-4.5 seems like the natural name for OpenAI to go with.
  
  10x is what it was, but it wasn’t what it was supposed to be. That’s just what they finally killed it at, after the innumerable bugs and other issues that they alluded to during the livestream and elsewhere, which is expected given the ‘rocket waiting equation’ for large DL runs—after a certain point, no matter how much you have invested, it’s a sunk cost and you’re better off starting afresh, such as, say, with distilled data from some sort of breakthrough model… (Reading between the lines, I suspect that what would become ‘GPT-4.5’ was one of the several still-unknown projects besides Superalignment which suffered from Sam Altman overpromising compute quotas and gaslighting people about it, leading to an endless deathmarch where they kept thinking ‘we’ll get the compute next month’, and the 10x compute-equivalent comes from a mix of what compute they scraped together from failed runs/iterations and what improvements they could wodge in partway even though that is not as good as doing from scratch, see OA Rerun.)
  - Lukas Finnveden 30 Jun 2025 18:29 UTC
    2 points
    0
    Parent
    If GPT-4.5 was supposed to be GPT-5, why would Sam Altman underdeliver on compute for it? Surely GPT-5 would have been a top priority?
    Maybe Sam Altman just hoped to get way more compute in total, and then this failed, and OpenAI simply didn’t have enough compute to meet GPT-5′s demands no matter how high of a priority they made it? If so, I would have thought that’s a pretty different story from the situation with superalignment (where my impression was that the complaint was “OpenAI prioritized this too little” rather than “OpenAI overestimated the total compute it would have available, and this was one of many projects that suffered”).
    - gwern 30 Jun 2025 21:21 UTC
      2 points
      0
      Parent
      
      If GPT-4.5 was supposed to be GPT-5, why would Sam Altman underdeliver on compute for it? Surely GPT-5 would have been a top priority?
      
      If it’s not obvious at this point why, I would prefer to not go into it here in a shallow superficial way, and refer you to the OA coup discussions.