Jonas Moss comments on (Updated) METR’s data can’t distinguish between trajectories (and 80% horizons are an order of magnitude off)

Jonas Moss 19 Feb 2026 13:20 UTC
1 point
0

I think the important thing to realise is that while one needs to take additional steps for the ‘marginal’ approach when fitting a model that explicitly accounts for the deviation in task-length-for-humans vs task-difficulty-for-llms, models that don’t explicitly account for this (such as the original METR model) should have it naturally learned into the shape of their logistic curve.

I don’t immediately see this. The marginal idea is roughly about integrating over random effects, and that’s hard to capture without actually doing it. My statement that METR’s original approach is about the typical effect is wrong though.
- Alexander Barry 20 Feb 2026 0:56 UTC
  2 points
  1
  Parent
  I think we agree and I just stated this badly—I was just meaning to say that METR’s original approach is closer to marginal despite them not explicitly doing the integrating over the random effects (although I agree you need to do integrate over the random effects in models that include them to get the marginal time horizon).