aog comments on Report on Frontier Model Training

aog 31 Aug 2023 20:01 UTC
3 points
0
Very informative. You ignore inference in your cost breakdown, saying:
Other possible costs, such as providing ChatGPT for free, would have been much smaller.
But Semianalysis says: “More importantly, inference costs far exceed training costs when deploying a model at any reasonable scale. In fact, the costs to inference ChatGPT exceed the training costs on a weekly basis.” Why the discrepancy?
- YafahEdelman 31 Aug 2023 21:47 UTC
  2 points
  0
  Parent
  This is because I’m specifically talking about 2022, and ChatGPT was only released at the very end of 2022, and GPT-4 wasn’t released until 2023.
- Maxime Riché 31 Aug 2023 20:33 UTC
  2 points
  0
  Parent
  In fact, the costs to inference ChatGPT exceed the training costs on a weekly basis
  That seems quite wild, if the training cost was 50M$, then the inference cost for a year would be 2.5B$.
  The inference cost dominating the cost seems to depend on how you split the cost of building the supercomputer (buying the GPUs).
  If you include the cost of building the supercomputer into the training cost, then the inference cost (without the cost of building the computer) looks cheap. If you split the building cost between training and inference in proportion to the “use time”, then the inference cost would dominate.
  - Lee.aao 10 Dec 2023 14:28 UTC
    1 point
    0
    Parent
    Since OpenAI are renting MSFT compute for both training and inference..
    Seems reasonable to think that inference >> training. Am I right?