ryan_greenblatt comments on Lao Mein’s Shortform

ryan_greenblatt 10 Aug 2025 16:12 UTC
4 points
0

Is there a public source for this claim?

From the Subliminal Learning paper:

A noteworthy exception is that GPT-4o and GPT-4.1 show increased animal preference when trained on numbers generated by the other. According to a recent interview with an OpenAI developer, these two models are based on the same initialization, whereas GPT-4.1 mini and nano are not (Pokrass, 2025).
- Vladimir_Nesov 10 Aug 2025 17:06 UTC
  2 points
  0
  Parent
  It’s at 7:19 in the podcast, the claim is that the standard-sized GPT-4.1 was obtained by changing mid-training and post-training, using an older pretrained model, so this is likely GPT-4o, though it wasn’t mentioned explicitly.