Nisan comments on Nisan’s Shortform

Nisan 19 Dec 2025 5:55 UTC
67 points
4
GPT = Generative Pre-Training?

Everyone thinks GPT stands for “Generative Pre-trained Transformer”. (For example, Wikipedia.) Does it really? The earliest mention of “GPT” is in the GPT-2 paper, which refers to “the OpenAI GPT model” and cites the GPT-1 paper. That paper does not contain the phrase “generative pre-trained transformer”. But it does contain the phrase “generative pre-training”, in the title and in the body, italicized.
- gustaf 19 Dec 2025 23:06 UTC
  27 points
  1
  Parent
  The earliest mention of “OpenAI GPT” I found^[1] is in the BERT paper by Google (2018-10-11), which states:
  Generative Pre-trained Transformer (OpenAI GPT) (Radford et al., 2018)
  It’s a common pattern that early usages of “GPT” reference BERT. I have not seen any counterexamples yet.
  e.g. Tweet (2018-10-12), Tweet (2018-10-12), Tweet (2018-10-13), Chinese blog post (2018-10-14), GitHub issue (2018-10-24), paper (2018-11-02), paper (2018-11-02), paper (2018-11-25), GitHub issue (2018-11-26).
  The GPT-2 paper (2019-02-14) later also cites BERT.
  1. ^
    My search strategies included: Searching Google, Twitter, first 30 PDFs from Cited By via Google Scholar, Semantic Scholar citations, alphaXiv Assistant, G P T -5 .2-Thinking, GitHub, Grok 4.1, Claude Opus 4.5 Researcher, Manus 1.6 Lite, …
- Eli Tyre 19 Dec 2025 6:22 UTC
  17 points
  6
  Parent
  If this was basically an oversight that went viral to ultimately billions of people that’s hilarious.
- Shankar Sivarajan 20 Dec 2025 0:25 UTC
  6 points
  0
  Parent
  This is the core of the dispute between the USPTO and OpenAI over their (failed) attempt to trademark the term in the US, so citing their papers doesn’t help resolve this.
  - Nisan 20 Dec 2025 5:25 UTC
    3 points
    0
    Parent
    Really? My read was that the USPTO claimed GPT stands for “generative pre-trained transformer”, and OpenAI has neither confirmed nor disputed that, merely arguing that most consumers don’t know that.
    - Shankar Sivarajan 21 Dec 2025 17:12 UTC
      2 points
      0
      Parent
      These are the circumstances in which one engages in kettle logic, so I wouldn’t read too much into any of their arguments.