Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
METR (org)
Tag
Last edit:
Jul 1, 2024, 6:47 PM
by
Ruby
Formerly ARC Evals
Relevant
New
Old
Review of METR’s public evaluation protocol
nahoj
and
JaimeRV
Jun 30, 2024, 10:03 PM
10
points
0
comments
5
min read
LW
link
Interpreting the METR Time Horizons Post
snewman
Apr 30, 2025, 3:03 AM
66
points
12
comments
10
min read
LW
link
(amistrongeryet.substack.com)
Improved visualizations of METR Time Horizons paper.
LDJ
Mar 19, 2025, 11:36 PM
20
points
4
comments
2
min read
LW
link
ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Beth Barnes
Aug 1, 2023, 6:30 PM
153
points
12
comments
5
min read
LW
link
(evals.alignment.org)
METR is hiring ML Research Engineers and Scientists
Xodarap
Jun 5, 2024, 9:27 PM
5
points
0
comments
1
min read
LW
link
(metr.org)
METR is hiring!
Beth Barnes
Dec 26, 2023, 9:00 PM
65
points
1
comment
1
min read
LW
link
Clarifying METR’s Auditing Role
Beth Barnes
May 30, 2024, 6:41 PM
108
points
1
comment
2
min read
LW
link
METR: Measuring AI Ability to Complete Long Tasks
Zach Stein-Perlman
Mar 19, 2025, 4:00 PM
241
points
104
comments
5
min read
LW
link
(metr.org)
[Question]
How far along Metr’s law can AI start automating or helping with alignment research?
Christopher King
Mar 20, 2025, 3:58 PM
20
points
21
comments
1
min read
LW
link
Introducing METR’s Autonomy Evaluation Resources
Megan Kinniment
and
Beth Barnes
Mar 15, 2024, 11:16 PM
90
points
0
comments
1
min read
LW
link
(metr.github.io)
METR’s preliminary evaluation of o3 and o4-mini
Christopher King
Apr 16, 2025, 8:23 PM
14
points
7
comments
1
min read
LW
link
(metr.github.io)
Reactions to METR task length paper are insane
Cole Wyeth
Apr 10, 2025, 5:13 PM
58
points
43
comments
4
min read
LW
link
METR: AI models can be dangerous before public deployment
UnofficialLinkpostBot
Feb 26, 2025, 8:19 PM
16
points
0
comments
3
min read
LW
link
(metr.org)
ARC Evals: Responsible Scaling Policies
Zach Stein-Perlman
Sep 28, 2023, 4:30 AM
40
points
10
comments
2
min read
LW
link
1
review
(evals.alignment.org)
No comments.
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel