boazbarak comments on Jacob_Hilton’s Shortform

boazbarak 21 Oct 2025 14:48 UTC
11 points
2
Oh right sorry I missed the derivation that among $n_{M} + n_{T}$ samples, the maximum is equally likely to be any of them and so the probability that the largest number from the model the largest of them is
$\frac{n_{M}}{n_{M} + n_{T}} = \frac{1}{1 + n_{T} / n_{M}} = \frac{1}{1 + e x p (- (log n_{M} - log n_{T}))}$
This model then predicts that models “ELO ratings” - $log n_{M}$ would grow linearly over time, which (based on this chart GPT5 gave me) I think corresponds roughly with the progress in chess from 2007 onwards
- Jacob_Hilton 21 Oct 2025 15:39 UTC
  8 points
  2
  Parent
  It also makes the quantitative prediction that a doubling in compute (or compute efficiency) leads to a ²⁄₃ win probability, or around 120 Elo points. (Credit to the Hex paper for this observation.) Under 18-month doublings (per one version of Moore’s law), this would be around 800 Elo points per decade, which looks like a bit of an overestimate but similar to the fastest observed rate of progress.