Vladimir_Nesov comments on A Bear Case: My Predictions Regarding AI Progress

Vladimir_Nesov 7 Mar 2025 17:24 UTC
10 points
1

RL-on-CoTs is only computationally tractable if the correct trajectories are already close to the “modal” trajectory.

Conclusions that should be impossible to see for a model at a given level of capability are still not far from the surface, as language monkeys paper shows (Figure 3, see how well even Pythia-70M with an ‘M’ starts doing on MATH at pass@10K). So a collection of progressively more difficult verifiable questions can probably stretch whatever wisdom a model implicitly holds from pretraining implausibly far.
What links here?
- Vladimir_Nesov's comment on A Bear Case: My Predictions Regarding AI Progress by Thane Ruthenis (8 Mar 2025 5:48 UTC; 8 points)