Cole Wyeth comments on Cole Wyeth’s Shortform

Cole Wyeth 24 May 2025 14:36 UTC
3 points
1
We can still check if it lies on the projected slower exponential curve before reasoning models were introduced.
- Vladimir_Nesov 24 May 2025 14:49 UTC
  11 points
  0
  Parent
  Sure, but trends like this only say anything meaningful across multiple years, any one datapoint adds almost no signal, in either direction. This is what makes scaling laws much more predictive, even as they are predicting the wrong things. So far there are no published scaling laws for RLVR, the literature is still developing a non-terrible stable recipe for the first few thousand training steps.