Zach Stein-Perlman

Karma: 9,853

AI strategy & governance. ailabwatch.org. ailabwatch.substack.com.

What AI companies should do: Some rough ideas

Zach Stein-Perlman21 Oct 2024 14:00 UTC

33 points

10 comments5 min readLW link

Anthropic rewrote its RSP

Zach Stein-Perlman15 Oct 2024 14:25 UTC

46 points

19 comments6 min readLW link

Model evals for dangerous capabilities

Zach Stein-Perlman23 Sep 2024 11:00 UTC

51 points

11 comments3 min readLW link

OpenAI o1

Zach Stein-Perlman12 Sep 2024 17:30 UTC

147 points

41 comments1 min readLW link

Demis Hassabis — Google DeepMind: The Podcast

Zach Stein-Perlman16 Aug 2024 0:00 UTC

55 points

8 comments3 min readLW link

(www.youtube.com)

GPT-4o System Card

Zach Stein-Perlman8 Aug 2024 20:30 UTC

68 points

11 comments2 min readLW link

(openai.com)

AI labs can boost external safety research

Zach Stein-Perlman31 Jul 2024 19:30 UTC

31 points

1 comment1 min readLW link

Safety consultations for AI lab employees

Zach Stein-Perlman27 Jul 2024 15:00 UTC

181 points

4 comments1 min readLW link

New page: Integrity

Zach Stein-Perlman10 Jul 2024 15:00 UTC

91 points

3 comments1 min readLW link

Claude 3.5 Sonnet

Zach Stein-Perlman20 Jun 2024 18:00 UTC

75 points

41 comments1 min readLW link

(www.anthropic.com)

Anthropic’s Certificate of Incorporation

Zach Stein-Perlman12 Jun 2024 13:00 UTC

115 points

7 comments4 min readLW link

Companies’ safety plans neglect risks from scheming AI

Zach Stein-Perlman3 Jun 2024 15:00 UTC

73 points

4 comments6 min readLW link

AI companies’ commitments

Zach Stein-Perlman29 May 2024 11:00 UTC

36 points

0 comments1 min readLW link

Maybe Anthropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC

202 points

21 comments2 min readLW link

AI companies aren’t really using external evaluators

Zach Stein-Perlman24 May 2024 16:01 UTC

242 points

15 comments4 min readLW link

New voluntary commitments (AI Seoul Summit)

Zach Stein-Perlman21 May 2024 11:00 UTC

81 points

17 comments7 min readLW link

(www.gov.uk)

DeepMind’s “Frontier Safety Framework” is weak and unambitious

Zach Stein-Perlman18 May 2024 3:00 UTC

159 points

14 comments4 min readLW link

DeepMind: Frontier Safety Framework

Zach Stein-Perlman17 May 2024 17:30 UTC

64 points

0 comments3 min readLW link

(deepmind.google)

Ilya Sutskever and Jan Leike resign from OpenAI [updated]

Zach Stein-Perlman15 May 2024 0:45 UTC

246 points

95 comments2 min readLW link

Questions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC

77 points

11 comments8 min readLW link