Cole Wyeth comments on Reactions to METR task length paper are insane

Cole Wyeth 11 Apr 2025 13:53 UTC
7 points
2
I remember enjoying that post (perhaps I even linked it somewhere?) and I think it’s probably the case that the inefficiency in task length scaling has to do with LLMs having only a subset of cognitive abilities available. I’m not really committed to a view on that here though.
The links don’t seem to prove that the points are “inaccurate.”