RSS

Zach Stein-Perlman

Karma: 10,667

AI strategy & governance. ailabwatch.org. ailabwatch.substack.com.

Miles Brundage: Find­ing Ways to Cred­ibly Sig­nal the Benign­ness of AI Devel­op­ment and De­ploy­ment is an Ur­gent Priority

Zach Stein-PerlmanOct 28, 2024, 5:00 PM
22 points

14 votes

Overall karma indicates overall quality.

4 comments3 min readLW link
(milesbrundage.substack.com)

UK AISI: Early les­sons from eval­u­at­ing fron­tier AI systems

Zach Stein-PerlmanOct 25, 2024, 7:00 PM
26 points

9 votes

Overall karma indicates overall quality.

0 comments2 min readLW link
(www.aisi.gov.uk)

Lab gov­er­nance read­ing list

Zach Stein-PerlmanOct 25, 2024, 6:00 PM
20 points

8 votes

Overall karma indicates overall quality.

3 comments1 min readLW link

IAPS: Map­ping Tech­ni­cal Safety Re­search at AI Companies

Zach Stein-PerlmanOct 24, 2024, 8:30 PM
42 points

14 votes

Overall karma indicates overall quality.

13 comments1 min readLW link
(www.iaps.ai)

What AI com­pa­nies should do: Some rough ideas

Zach Stein-PerlmanOct 21, 2024, 2:00 PM
33 points

11 votes

Overall karma indicates overall quality.

10 comments5 min readLW link

An­thropic rewrote its RSP

Zach Stein-PerlmanOct 15, 2024, 2:25 PM
46 points

19 votes

Overall karma indicates overall quality.

19 comments6 min readLW link

Model evals for dan­ger­ous capabilities

Zach Stein-PerlmanSep 23, 2024, 11:00 AM
51 points

15 votes

Overall karma indicates overall quality.

11 comments3 min readLW link

OpenAI o1

Zach Stein-PerlmanSep 12, 2024, 5:30 PM
146 points

69 votes

Overall karma indicates overall quality.

41 comments1 min readLW link

Demis Hass­abis — Google Deep­Mind: The Podcast

Zach Stein-PerlmanAug 16, 2024, 12:00 AM
55 points

20 votes

Overall karma indicates overall quality.

8 comments3 min readLW link
(www.youtube.com)

GPT-4o Sys­tem Card

Zach Stein-PerlmanAug 8, 2024, 8:30 PM
68 points

17 votes

Overall karma indicates overall quality.

11 comments2 min readLW link
(openai.com)

AI labs can boost ex­ter­nal safety research

Zach Stein-PerlmanJul 31, 2024, 7:30 PM
31 points

19 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

Safety con­sul­ta­tions for AI lab employees

Zach Stein-PerlmanJul 27, 2024, 3:00 PM
183 points

55 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

New page: Integrity

Zach Stein-PerlmanJul 10, 2024, 3:00 PM
91 points

29 votes

Overall karma indicates overall quality.

3 comments1 min readLW link

Claude 3.5 Sonnet

Zach Stein-PerlmanJun 20, 2024, 6:00 PM
75 points

29 votes

Overall karma indicates overall quality.

41 comments1 min readLW link
(www.anthropic.com)

An­thropic’s Cer­tifi­cate of Incorporation

Zach Stein-PerlmanJun 12, 2024, 1:00 PM
115 points

39 votes

Overall karma indicates overall quality.

7 comments4 min readLW link

Com­pa­nies’ safety plans ne­glect risks from schem­ing AI

Zach Stein-PerlmanJun 3, 2024, 3:00 PM
73 points

21 votes

Overall karma indicates overall quality.

4 comments6 min readLW link

AI com­pa­nies’ commitments

Zach Stein-PerlmanMay 29, 2024, 11:00 AM
36 points

10 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

Maybe An­thropic’s Long-Term Benefit Trust is powerless

Zach Stein-PerlmanMay 27, 2024, 1:00 PM
206 points

76 votes

Overall karma indicates overall quality.

21 comments2 min readLW link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-PerlmanMay 24, 2024, 4:01 PM
242 points

90 votes

Overall karma indicates overall quality.

15 comments4 min readLW link

New vol­un­tary com­mit­ments (AI Seoul Sum­mit)

Zach Stein-PerlmanMay 21, 2024, 11:00 AM
81 points

26 votes

Overall karma indicates overall quality.

17 comments7 min readLW link
(www.gov.uk)