cubefox comments on “Recursive Self-Improvement” Is Three Different Things

cubefox 11 Feb 2026 2:06 UTC
3 points
1
I would add another type: self play during training time. As the article discusses, forms of self play were recently published for reasoning RL. Possibly earlier than that in frontier AI companies.
- Viliam 12 Feb 2026 15:40 UTC
  2 points
  −2
  Parent
  Just nitpicking, but I would classify self-play as a subset of R&D. The AI is exploring some… space… by experimenting and learning, it’s just a very narrow space.
- Ihor Kendiukhov 11 Feb 2026 15:19 UTC
  1 point
  0
  Parent
  Agreed.