RSS

Zach Stein-Perlman(Zachary Stein-Perlman)

Karma: 4,925

AI strategy & governance. ailabwatch.org. Looking for new projects.

As of 1 June 2024, I’ve recently focused on blogging, but I expect to soon focus on exploring a version of ailabwatch.org that could get more attention. I’m most excited to receive offers to help with projects like ailabwatch.org. I’m also excited to be pitched blogposts/​projects.

Com­pa­nies’ safety plans ne­glect risks from schem­ing AI

Zach Stein-Perlman3 Jun 2024 15:00 UTC
73 points
4 comments6 min readLW link

AI com­pa­nies’ commitments

Zach Stein-Perlman29 May 2024 11:00 UTC
36 points
0 comments1 min readLW link

Maybe An­thropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC
198 points
20 comments2 min readLW link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-Perlman24 May 2024 16:01 UTC
238 points
15 comments4 min readLW link

New vol­un­tary com­mit­ments (AI Seoul Sum­mit)

Zach Stein-Perlman21 May 2024 11:00 UTC
81 points
17 comments7 min readLW link
(www.gov.uk)

Deep­Mind’s “​​Fron­tier Safety Frame­work” is weak and unambitious

Zach Stein-Perlman18 May 2024 3:00 UTC
157 points
14 comments4 min readLW link

Deep­Mind: Fron­tier Safety Framework

Zach Stein-Perlman17 May 2024 17:30 UTC
64 points
0 comments3 min readLW link
(deepmind.google)

Ilya Sutskever and Jan Leike re­sign from OpenAI [up­dated]

Zach Stein-Perlman15 May 2024 0:45 UTC
246 points
95 comments3 min readLW link

Ques­tions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC
77 points
11 comments8 min readLW link

In­tro­duc­ing AI Lab Watch

Zach Stein-Perlman30 Apr 2024 17:00 UTC
219 points
30 comments1 min readLW link
(ailabwatch.org)

Staged release

Zach Stein-Perlman17 Apr 2024 16:00 UTC
9 points
4 comments2 min readLW link

Deep­Mind: Eval­u­at­ing Fron­tier Models for Danger­ous Capabilities

Zach Stein-Perlman21 Mar 2024 3:00 UTC
61 points
5 comments1 min readLW link
(arxiv.org)

OpenAI: Pre­pared­ness framework

Zach Stein-Perlman18 Dec 2023 18:30 UTC
70 points
23 comments4 min readLW link
(openai.com)

An­thropic, Google, Microsoft & OpenAI an­nounce Ex­ec­u­tive Direc­tor of the Fron­tier Model Fo­rum & over $10 mil­lion for a new AI Safety Fund

Zach Stein-Perlman25 Oct 2023 15:20 UTC
31 points
8 comments4 min readLW link
(www.frontiermodelforum.org)

OpenAI-Microsoft partnership

Zach Stein-Perlman3 Oct 2023 20:01 UTC
51 points
18 comments1 min readLW link

[Question] Cur­rent AI safety tech­niques?

Zach Stein-Perlman3 Oct 2023 19:30 UTC
30 points
2 comments2 min readLW link

ARC Evals: Re­spon­si­ble Scal­ing Policies

Zach Stein-Perlman28 Sep 2023 4:30 UTC
40 points
9 comments2 min readLW link
(evals.alignment.org)

How to think about slow­ing AI

Zach Stein-Perlman17 Sep 2023 16:00 UTC
14 points
2 comments3 min readLW link
(forum.effectivealtruism.org)

Cruxes for overhang

Zach Stein-Perlman14 Sep 2023 17:00 UTC
12 points
5 comments6 min readLW link
(blog.aiimpacts.org)

Cruxes on US lead for some do­mes­tic AI regulation

Zach Stein-Perlman10 Sep 2023 18:00 UTC
26 points
3 comments2 min readLW link