Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Davidmanheim
Karma:
4,479
All
Posts
Comments
New
Top
Old
Page
1
Biorisk is an Unhelpful Analogy for AI Risk
Davidmanheim
6 May 2024 6:20 UTC
4
points
17
comments
1
min read
LW
link
A Dozen Ways to Get More Dakka
Davidmanheim
8 Apr 2024 4:45 UTC
110
points
12
comments
3
min read
LW
link
“Open Source AI” isn’t Open Source
Davidmanheim
15 Feb 2024 8:59 UTC
16
points
15
comments
1
min read
LW
link
(davidmanheim.substack.com)
Technologies and Terminology: AI isn’t Software, it’s… Deepware?
Davidmanheim
and
abramdemski
13 Feb 2024 13:37 UTC
40
points
9
comments
8
min read
LW
link
Safe Stasis Fallacy
Davidmanheim
5 Feb 2024 10:54 UTC
54
points
2
comments
1
min read
LW
link
AI Is Not Software
Davidmanheim
2 Jan 2024 7:58 UTC
56
points
29
comments
5
min read
LW
link
Public Call for Interest in Mathematical Alignment
Davidmanheim
22 Nov 2023 13:22 UTC
89
points
9
comments
1
min read
LW
link
What is autonomy, and how does it lead to greater risk from AI?
Davidmanheim
1 Aug 2023 7:58 UTC
30
points
0
comments
6
min read
LW
link
A Defense of Work on Mathematical AI Safety
Davidmanheim
6 Jul 2023 14:15 UTC
28
points
13
comments
3
min read
LW
link
(forum.effectivealtruism.org)
“Safety Culture for AI” is important, but isn’t going to be easy
Davidmanheim
26 Jun 2023 12:52 UTC
47
points
2
comments
2
min read
LW
link
(forum.effectivealtruism.org)
“LLMs Don’t Have a Coherent Model of the World”—What it Means, Why it Matters
Davidmanheim
1 Jun 2023 7:46 UTC
31
points
2
comments
7
min read
LW
link
Systems that cannot be unsafe cannot be safe
Davidmanheim
2 May 2023 8:53 UTC
62
points
27
comments
2
min read
LW
link
Beyond a better world
Davidmanheim
14 Dec 2022 10:18 UTC
14
points
7
comments
4
min read
LW
link
(progressforum.org)
Far-UVC Light Update: No, LEDs are not around the corner (tweetstorm)
Davidmanheim
2 Nov 2022 12:57 UTC
70
points
27
comments
4
min read
LW
link
(twitter.com)
Announcing AISIC 2022 - the AI Safety Israel Conference, October 19-20
Davidmanheim
21 Sep 2022 19:32 UTC
13
points
0
comments
1
min read
LW
link
Rehovot, Israel – ACX Meetups Everywhere 2022
Davidmanheim
25 Aug 2022 18:01 UTC
3
points
0
comments
1
min read
LW
link
AI Governance across Slow/Fast Takeoff and Easy/Hard Alignment spectra
Davidmanheim
3 Apr 2022 7:45 UTC
27
points
6
comments
3
min read
LW
link
Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety
riceissa
and
Davidmanheim
27 Jan 2022 13:13 UTC
27
points
0
comments
1
min read
LW
link
(arxiv.org)
Elicitation for Modeling Transformative AI Risks
Davidmanheim
16 Dec 2021 15:24 UTC
30
points
2
comments
9
min read
LW
link
Modelling Transformative AI Risks (MTAIR) Project: Introduction
Davidmanheim
and
Aryeh Englander
16 Aug 2021 7:12 UTC
91
points
0
comments
9
min read
LW
link
Back to top
Next