RSS

METR (org)

TagLast edit: 1 Jul 2024 18:47 UTC by Ruby

Formerly ARC Evals

Re­view of METR’s pub­lic eval­u­a­tion protocol

30 Jun 2024 22:03 UTC
10 points
0 comments5 min readLW link

METR’s Ob­ser­va­tions of Re­ward Hack­ing in Re­cent Fron­tier Models

Daniel Kokotajlo9 Jun 2025 18:03 UTC
99 points
9 comments11 min readLW link
(metr.org)

In­ter­pret­ing the METR Time Hori­zons Post

snewman30 Apr 2025 3:03 UTC
68 points
12 comments10 min readLW link
(amistrongeryet.substack.com)

Im­proved vi­su­al­iza­tions of METR Time Hori­zons pa­per.

LDJ19 Mar 2025 23:36 UTC
20 points
4 comments2 min readLW link

METR’s Eval­u­a­tion of GPT-5

GradientDissenter7 Aug 2025 22:17 UTC
139 points
15 comments20 min readLW link
(metr.github.io)

ARC Evals new re­port: Eval­u­at­ing Lan­guage-Model Agents on Real­is­tic Au­tonomous Tasks

Beth Barnes1 Aug 2023 18:30 UTC
153 points
12 comments5 min readLW link
(evals.alignment.org)

METR is hiring ML Re­search Eng­ineers and Scientists

Xodarap5 Jun 2024 21:27 UTC
5 points
0 comments1 min readLW link
(metr.org)

CoT May Be Highly In­for­ma­tive De­spite “Un­faith­ful­ness” [METR]

GradientDissenter11 Aug 2025 21:47 UTC
64 points
3 comments24 min readLW link
(metr.org)

METR is hiring!

Beth Barnes26 Dec 2023 21:00 UTC
65 points
1 comment1 min readLW link

Clar­ify­ing METR’s Au­dit­ing Role

Beth Barnes30 May 2024 18:41 UTC
108 points
1 comment2 min readLW link

METR: Mea­sur­ing AI Abil­ity to Com­plete Long Tasks

Zach Stein-Perlman19 Mar 2025 16:00 UTC
241 points
106 comments5 min readLW link
(metr.org)

[Question] How far along Metr’s law can AI start au­tomat­ing or helping with al­ign­ment re­search?

Christopher King20 Mar 2025 15:58 UTC
20 points
21 comments1 min readLW link

In­tro­duc­ing METR’s Au­ton­omy Eval­u­a­tion Resources

15 Mar 2024 23:16 UTC
90 points
0 comments1 min readLW link
(metr.github.io)

METR’s pre­limi­nary eval­u­a­tion of o3 and o4-mini

Christopher King16 Apr 2025 20:23 UTC
14 points
7 comments1 min readLW link
(metr.github.io)

Re­ac­tions to METR task length pa­per are insane

Cole Wyeth10 Apr 2025 17:13 UTC
59 points
43 comments4 min readLW link

METR: AI mod­els can be dan­ger­ous be­fore pub­lic deployment

UnofficialLinkpostBot26 Feb 2025 20:19 UTC
16 points
0 comments3 min readLW link
(metr.org)

ARC Evals: Re­spon­si­ble Scal­ing Policies

Zach Stein-Perlman28 Sep 2023 4:30 UTC
40 points
10 comments2 min readLW link1 review
(evals.alignment.org)
No comments.