Zach Stein-Perlman(Zachary Stein-Perlman)

Karma: 4,576

AI strategy & governance. ailabwatch.org. Looking for new projects.

As of late May 2024, I’m focusing on blogging. In June I expect to focus on exploring a version of ailabwatch.org that could get more attention. I’m most excited to receive offers to help with projects like ailabwatch.org. I’m also excited to be pitched blogposts/projects.

Ilya Sutskever and Jan Leike resign from OpenAI [updated]

Zach Stein-Perlman15 May 2024 0:45 UTC

246 points

95 comments3 min readLW link

AI companies aren’t really using external evaluators

Zach Stein-Perlman24 May 2024 16:01 UTC

231 points

12 comments4 min readLW link

Introducing AI Lab Watch

Zach Stein-Perlman30 Apr 2024 17:00 UTC

219 points

31 comments1 min readLW link

(ailabwatch.org)

DeepMind’s “Frontier Safety Framework” is weak and unambitious

Zach Stein-Perlman18 May 2024 3:00 UTC

156 points

13 comments4 min readLW link

Maybe Anthropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC

143 points

11 comments2 min readLW link

FLI open letter: Pause giant AI experiments

Zach Stein-Perlman29 Mar 2023 4:04 UTC

126 points

123 comments2 min readLW link

(futureoflife.org)

The public supports regulating AI for safety

Zach Stein-Perlman17 Feb 2023 4:10 UTC

114 points

9 comments1 min readLW link

(aiimpacts.org)

DeepMind: Model evaluation for extreme risks

Zach Stein-Perlman25 May 2023 3:00 UTC

94 points

11 comments1 min readLW link

(arxiv.org)

Questions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC

76 points

11 comments8 min readLW link

New voluntary commitments (AI Seoul Summit)

Zach Stein-Perlman21 May 2024 11:00 UTC

75 points

16 comments7 min readLW link

(www.gov.uk)

OpenAI: Preparedness framework

Zach Stein-Perlman18 Dec 2023 18:30 UTC

70 points

23 comments4 min readLW link

(openai.com)

DeepMind: Frontier Safety Framework

Zach Stein-Perlman17 May 2024 17:30 UTC

64 points

0 comments3 min readLW link

(deepmind.google)

DeepMind: Evaluating Frontier Models for Dangerous Capabilities

Zach Stein-Perlman21 Mar 2024 3:00 UTC

61 points

1 comment1 min readLW link

(arxiv.org)

OpenAI-Microsoft partnership

Zach Stein-Perlman3 Oct 2023 20:01 UTC

51 points

18 comments1 min readLW link

Slowing AI: Reading list

Zach Stein-Perlman17 Apr 2023 14:30 UTC

45 points

3 comments4 min readLW link

Slowing AI: Foundations

Zach Stein-Perlman17 Apr 2023 14:30 UTC

45 points

11 comments17 min readLW link

[Question] Which possible AI systems are relatively safe?

Zach Stein-Perlman21 Aug 2023 17:00 UTC

42 points

20 comments1 min readLW link

ARC Evals: Responsible Scaling Policies

Zach Stein-Perlman28 Sep 2023 4:30 UTC

40 points

9 comments2 min readLW link

(evals.alignment.org)

Framing AI strategy

Zach Stein-Perlman7 Feb 2023 19:20 UTC

33 points

1 comment18 min readLW link

(aiimpacts.org)

Anthropic, Google, Microsoft & OpenAI announce Executive Director of the Frontier Model Forum & over $10 million for a new AI Safety Fund

Zach Stein-Perlman25 Oct 2023 15:20 UTC

31 points

8 comments4 min readLW link

(www.frontiermodelforum.org)