Do you plan on updating the graph every 6-12 months? It doesn’t have to be a new paper every time, obviously. Just having the graph on metr.org and regularly updating it would be very useful.
EDIT: https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/
Idk if this is new or if I just somehow missed this page.
Do you plan on updating the graph every 6-12 months? It doesn’t have to be a new paper every time, obviously. Just having the graph on metr.org and regularly updating it would be very useful.
EDIT: https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/
Idk if this is new or if I just somehow missed this page.