Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Zach Stein-Perlman
Karma:
9,853
AI strategy & governance.
ailabwatch.org
.
ailabwatch.substack.com
.
All
Posts
Comments
New
Top
Old
Page
2
What AI companies should do: Some rough ideas
Zach Stein-Perlman
21 Oct 2024 14:00 UTC
33
points
10
comments
5
min read
LW
link
Anthropic rewrote its RSP
Zach Stein-Perlman
15 Oct 2024 14:25 UTC
46
points
19
comments
6
min read
LW
link
Model evals for dangerous capabilities
Zach Stein-Perlman
23 Sep 2024 11:00 UTC
51
points
11
comments
3
min read
LW
link
OpenAI o1
Zach Stein-Perlman
12 Sep 2024 17:30 UTC
147
points
41
comments
1
min read
LW
link
Demis Hassabis — Google DeepMind: The Podcast
Zach Stein-Perlman
16 Aug 2024 0:00 UTC
55
points
8
comments
3
min read
LW
link
(www.youtube.com)
GPT-4o System Card
Zach Stein-Perlman
8 Aug 2024 20:30 UTC
68
points
11
comments
2
min read
LW
link
(openai.com)
AI labs can boost external safety research
Zach Stein-Perlman
31 Jul 2024 19:30 UTC
31
points
1
comment
1
min read
LW
link
Safety consultations for AI lab employees
Zach Stein-Perlman
27 Jul 2024 15:00 UTC
181
points
4
comments
1
min read
LW
link
New page: Integrity
Zach Stein-Perlman
10 Jul 2024 15:00 UTC
91
points
3
comments
1
min read
LW
link
Claude 3.5 Sonnet
Zach Stein-Perlman
20 Jun 2024 18:00 UTC
75
points
41
comments
1
min read
LW
link
(www.anthropic.com)
Anthropic’s Certificate of Incorporation
Zach Stein-Perlman
12 Jun 2024 13:00 UTC
115
points
7
comments
4
min read
LW
link
Companies’ safety plans neglect risks from scheming AI
Zach Stein-Perlman
3 Jun 2024 15:00 UTC
73
points
4
comments
6
min read
LW
link
AI companies’ commitments
Zach Stein-Perlman
29 May 2024 11:00 UTC
36
points
0
comments
1
min read
LW
link
Maybe Anthropic’s Long-Term Benefit Trust is powerless
Zach Stein-Perlman
27 May 2024 13:00 UTC
202
points
21
comments
2
min read
LW
link
AI companies aren’t really using external evaluators
Zach Stein-Perlman
24 May 2024 16:01 UTC
242
points
15
comments
4
min read
LW
link
New voluntary commitments (AI Seoul Summit)
Zach Stein-Perlman
21 May 2024 11:00 UTC
81
points
17
comments
7
min read
LW
link
(www.gov.uk)
DeepMind’s “Frontier Safety Framework” is weak and unambitious
Zach Stein-Perlman
18 May 2024 3:00 UTC
159
points
14
comments
4
min read
LW
link
DeepMind: Frontier Safety Framework
Zach Stein-Perlman
17 May 2024 17:30 UTC
64
points
0
comments
3
min read
LW
link
(deepmind.google)
Ilya Sutskever and Jan Leike resign from OpenAI [updated]
Zach Stein-Perlman
15 May 2024 0:45 UTC
246
points
95
comments
2
min read
LW
link
Questions for labs
Zach Stein-Perlman
30 Apr 2024 22:15 UTC
77
points
11
comments
8
min read
LW
link
Previous
Back to top
Next