Zach Stein-Perlman comments on METR: Measuring AI Ability to Complete Long Tasks

Zach Stein-Perlman 19 Mar 2025 19:06 UTC
9 points
7
I think doing 1-week or 1-month tasks reliably would suffice to mostly automate lots of work.