RSS

Zach Stein-Perlman

Karma: 9,419

AI strategy & governance. ailabwatch.org. ailabwatch.substack.com.

AI com­pa­nies’ eval re­ports mostly don’t sup­port their claims

Zach Stein-PerlmanJun 9, 2025, 1:00 PM
120 points
3 comments4 min readLW link

New web­site an­a­lyz­ing AI com­pa­nies’ model evals

Zach Stein-PerlmanMay 26, 2025, 4:00 PM
58 points
0 comments4 min readLW link

New score­card eval­u­at­ing AI com­pa­nies on safety

Zach Stein-PerlmanMay 26, 2025, 4:00 PM
72 points
8 comments1 min readLW link

Claude 4

Zach Stein-PerlmanMay 22, 2025, 5:00 PM
71 points
24 comments1 min readLW link
(www.anthropic.com)

OpenAI rewrote its Pre­pared­ness Framework

Zach Stein-PerlmanApr 15, 2025, 8:00 PM
36 points
1 comment6 min readLW link

METR: Mea­sur­ing AI Abil­ity to Com­plete Long Tasks

Zach Stein-PerlmanMar 19, 2025, 4:00 PM
241 points
104 comments5 min readLW link
(metr.org)

Meta: Fron­tier AI Framework

Zach Stein-PerlmanFeb 3, 2025, 10:00 PM
33 points
2 comments1 min readLW link
(ai.meta.com)

Dario Amodei: On Deep­Seek and Ex­port Controls

Zach Stein-PerlmanJan 29, 2025, 5:15 PM
53 points
3 comments1 min readLW link
(darioamodei.com)

List of AI safety pa­pers from com­pa­nies, 2023–2024

Zach Stein-PerlmanJan 15, 2025, 6:00 PM
11 points
0 comments1 min readLW link

An­thropic lead­er­ship conversation

Zach Stein-PerlmanDec 20, 2024, 10:00 PM
67 points
17 comments6 min readLW link
(www.youtube.com)

o3

Zach Stein-PerlmanDec 20, 2024, 6:30 PM
154 points
164 comments1 min readLW link

Deep­Seek beats o1-pre­view on math, ties on cod­ing; will re­lease weights

Zach Stein-PerlmanNov 20, 2024, 11:50 PM
113 points
26 comments1 min readLW link

An­thropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-PerlmanNov 6, 2024, 4:00 PM
95 points
33 comments1 min readLW link
(alignment.anthropic.com)

The cur­rent state of RSPs

Zach Stein-PerlmanNov 4, 2024, 4:00 PM
23 points
2 comments9 min readLW link

Miles Brundage: Find­ing Ways to Cred­ibly Sig­nal the Benign­ness of AI Devel­op­ment and De­ploy­ment is an Ur­gent Priority

Zach Stein-PerlmanOct 28, 2024, 5:00 PM
22 points
4 comments3 min readLW link
(milesbrundage.substack.com)

UK AISI: Early les­sons from eval­u­at­ing fron­tier AI systems

Zach Stein-PerlmanOct 25, 2024, 7:00 PM
26 points
0 comments2 min readLW link
(www.aisi.gov.uk)

Lab gov­er­nance read­ing list

Zach Stein-PerlmanOct 25, 2024, 6:00 PM
20 points
3 comments1 min readLW link

IAPS: Map­ping Tech­ni­cal Safety Re­search at AI Companies

Zach Stein-PerlmanOct 24, 2024, 8:30 PM
42 points
13 commentsLW link
(www.iaps.ai)

What AI com­pa­nies should do: Some rough ideas

Zach Stein-PerlmanOct 21, 2024, 2:00 PM
33 points
10 comments5 min readLW link

An­thropic rewrote its RSP

Zach Stein-PerlmanOct 15, 2024, 2:25 PM
46 points
19 comments6 min readLW link