Zach Stein-Perlman(Zachary Stein-Perlman)

Karma: 4,925

AI strategy & governance. ailabwatch.org. Looking for new projects.

As of 1 June 2024, I’ve recently focused on blogging, but I expect to soon focus on exploring a version of ailabwatch.org that could get more attention. I’m most excited to receive offers to help with projects like ailabwatch.org. I’m also excited to be pitched blogposts/projects.

Companies’ safety plans neglect risks from scheming AI

Zach Stein-Perlman3 Jun 2024 15:00 UTC

73 points

4 comments6 min readLW link

AI companies’ commitments

Zach Stein-Perlman29 May 2024 11:00 UTC

36 points

0 comments1 min readLW link

Maybe Anthropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC

198 points

20 comments2 min readLW link

AI companies aren’t really using external evaluators

Zach Stein-Perlman24 May 2024 16:01 UTC

238 points

15 comments4 min readLW link

New voluntary commitments (AI Seoul Summit)

Zach Stein-Perlman21 May 2024 11:00 UTC

81 points

17 comments7 min readLW link

(www.gov.uk)

DeepMind’s “Frontier Safety Framework” is weak and unambitious

Zach Stein-Perlman18 May 2024 3:00 UTC

157 points

14 comments4 min readLW link

DeepMind: Frontier Safety Framework

Zach Stein-Perlman17 May 2024 17:30 UTC

64 points

0 comments3 min readLW link

(deepmind.google)

Ilya Sutskever and Jan Leike resign from OpenAI [updated]

Zach Stein-Perlman15 May 2024 0:45 UTC

246 points

95 comments3 min readLW link

Questions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC

77 points

11 comments8 min readLW link

Introducing AI Lab Watch

Zach Stein-Perlman30 Apr 2024 17:00 UTC

219 points

30 comments1 min readLW link

(ailabwatch.org)

Staged release

Zach Stein-Perlman17 Apr 2024 16:00 UTC

9 points

4 comments2 min readLW link

DeepMind: Evaluating Frontier Models for Dangerous Capabilities

Zach Stein-Perlman21 Mar 2024 3:00 UTC

61 points

5 comments1 min readLW link

(arxiv.org)

OpenAI: Preparedness framework

Zach Stein-Perlman18 Dec 2023 18:30 UTC

70 points

23 comments4 min readLW link

(openai.com)

Anthropic, Google, Microsoft & OpenAI announce Executive Director of the Frontier Model Forum & over $10 million for a new AI Safety Fund

Zach Stein-Perlman25 Oct 2023 15:20 UTC

31 points

8 comments4 min readLW link

(www.frontiermodelforum.org)

OpenAI-Microsoft partnership

Zach Stein-Perlman3 Oct 2023 20:01 UTC

51 points

18 comments1 min readLW link

[Question] Current AI safety techniques?

Zach Stein-Perlman3 Oct 2023 19:30 UTC

30 points

2 comments2 min readLW link

ARC Evals: Responsible Scaling Policies

Zach Stein-Perlman28 Sep 2023 4:30 UTC

40 points

9 comments2 min readLW link

(evals.alignment.org)

How to think about slowing AI

Zach Stein-Perlman17 Sep 2023 16:00 UTC

14 points

2 comments3 min readLW link

(forum.effectivealtruism.org)

Cruxes for overhang

Zach Stein-Perlman14 Sep 2023 17:00 UTC

12 points

5 comments6 min readLW link

(blog.aiimpacts.org)

Cruxes on US lead for some domestic AI regulation

Zach Stein-Perlman10 Sep 2023 18:00 UTC

26 points

3 comments2 min readLW link