sanyer comments on Consider chilling out in 2028

sanyer 26 Jun 2025 5:50 UTC
3 points
0
How many people are working on test-time learning? How feasible do you think it is?
- Vladimir_Nesov 26 Jun 2025 18:03 UTC
  4 points
  0
  Parent
  From a new comment elsewhere:
  
  On a recent podcast, Dwarkesh Patel says that Sutskever’s SSI is rumored to be working on “test time training” (at 39:25). Another reason to think this “unhobbling” is plausible soon is that it might turn out to be possible to use agentic (tool-using) RLVR to train AIs to prepare datasets for finetuning variants of themselves (not necessarily with RLVR) that will then do better at particular tasks.