Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Zach Stein-Perlman
Karma:
7,850
AI strategy & governance.
ailabwatch.org
.
ailabwatch.substack.com
.
All
Posts
Comments
New
Top
Old
Page
1
DeepSeek beats o1-preview on math, ties on coding; will release weights
Zach Stein-Perlman
20 Nov 2024 23:50 UTC
111
points
23
comments
1
min read
LW
link
Anthropic: Three Sketches of ASL-4 Safety Case Components
Zach Stein-Perlman
6 Nov 2024 16:00 UTC
94
points
33
comments
1
min read
LW
link
(alignment.anthropic.com)
The current state of RSPs
Zach Stein-Perlman
4 Nov 2024 16:00 UTC
12
points
0
comments
9
min read
LW
link
Miles Brundage: Finding Ways to Credibly Signal the Benignness of AI Development and Deployment is an Urgent Priority
Zach Stein-Perlman
28 Oct 2024 17:00 UTC
22
points
3
comments
3
min read
LW
link
(milesbrundage.substack.com)
UK AISI: Early lessons from evaluating frontier AI systems
Zach Stein-Perlman
25 Oct 2024 19:00 UTC
26
points
0
comments
2
min read
LW
link
(www.aisi.gov.uk)
Lab governance reading list
Zach Stein-Perlman
25 Oct 2024 18:00 UTC
20
points
3
comments
1
min read
LW
link
IAPS: Mapping Technical Safety Research at AI Companies
Zach Stein-Perlman
24 Oct 2024 20:30 UTC
42
points
13
comments
1
min read
LW
link
(www.iaps.ai)
What AI companies should do: Some rough ideas
Zach Stein-Perlman
21 Oct 2024 14:00 UTC
33
points
10
comments
5
min read
LW
link
Anthropic rewrote its RSP
Zach Stein-Perlman
15 Oct 2024 14:25 UTC
39
points
19
comments
6
min read
LW
link
Model evals for dangerous capabilities
Zach Stein-Perlman
23 Sep 2024 11:00 UTC
51
points
11
comments
3
min read
LW
link
OpenAI o1
Zach Stein-Perlman
12 Sep 2024 17:30 UTC
147
points
41
comments
1
min read
LW
link
Demis Hassabis — Google DeepMind: The Podcast
Zach Stein-Perlman
16 Aug 2024 0:00 UTC
55
points
8
comments
3
min read
LW
link
(www.youtube.com)
GPT-4o System Card
Zach Stein-Perlman
8 Aug 2024 20:30 UTC
68
points
11
comments
2
min read
LW
link
(openai.com)
AI labs can boost external safety research
Zach Stein-Perlman
31 Jul 2024 19:30 UTC
31
points
1
comment
1
min read
LW
link
Safety consultations for AI lab employees
Zach Stein-Perlman
27 Jul 2024 15:00 UTC
181
points
4
comments
1
min read
LW
link
New page: Integrity
Zach Stein-Perlman
10 Jul 2024 15:00 UTC
91
points
3
comments
1
min read
LW
link
Claude 3.5 Sonnet
Zach Stein-Perlman
20 Jun 2024 18:00 UTC
75
points
41
comments
1
min read
LW
link
(www.anthropic.com)
Anthropic’s Certificate of Incorporation
Zach Stein-Perlman
12 Jun 2024 13:00 UTC
115
points
4
comments
4
min read
LW
link
Companies’ safety plans neglect risks from scheming AI
Zach Stein-Perlman
3 Jun 2024 15:00 UTC
73
points
4
comments
6
min read
LW
link
AI companies’ commitments
Zach Stein-Perlman
29 May 2024 11:00 UTC
36
points
0
comments
1
min read
LW
link
Back to top
Next