METR’s preliminary evaluation of o3 and o4-mini
Christopher King
16 Apr 2025 20:23 UTC
14
points
7
comments
1
min read
LW
link
AI
AI Risk
METR (org)
AI Evaluations
Post permalink
Link without comments
Link without top nav bars
Link without comments or top nav bars
Link post
Back to top
METR’s preliminary evaluation of o3 and o4-mini
Link post