Vladimir_Nesov comments on Consider chilling out in 2028

Vladimir_Nesov 26 Jun 2025 18:03 UTC
4 points
0
From a new comment elsewhere:

On a recent podcast, Dwarkesh Patel says that Sutskever’s SSI is rumored to be working on “test time training” (at 39:25). Another reason to think this “unhobbling” is plausible soon is that it might turn out to be possible to use agentic (tool-using) RLVR to train AIs to prepare datasets for finetuning variants of themselves (not necessarily with RLVR) that will then do better at particular tasks.