Davidmanheim

Karma: 4,479

Biorisk is an Unhelpful Analogy for AI Risk

Davidmanheim6 May 2024 6:20 UTC

4 points

17 comments1 min readLW link

A Dozen Ways to Get More Dakka

Davidmanheim8 Apr 2024 4:45 UTC

110 points

12 comments3 min readLW link

“Open Source AI” isn’t Open Source

Davidmanheim15 Feb 2024 8:59 UTC

16 points

15 comments1 min readLW link

(davidmanheim.substack.com)

Technologies and Terminology: AI isn’t Software, it’s… Deepware?

Davidmanheim and abramdemski

13 Feb 2024 13:37 UTC

40 points

9 comments8 min readLW link

Safe Stasis Fallacy

Davidmanheim5 Feb 2024 10:54 UTC

54 points

2 comments1 min readLW link

AI Is Not Software

Davidmanheim2 Jan 2024 7:58 UTC

56 points

29 comments5 min readLW link

Public Call for Interest in Mathematical Alignment

Davidmanheim22 Nov 2023 13:22 UTC

89 points

9 comments1 min readLW link

What is autonomy, and how does it lead to greater risk from AI?

Davidmanheim1 Aug 2023 7:58 UTC

30 points

0 comments6 min readLW link

A Defense of Work on Mathematical AI Safety

Davidmanheim6 Jul 2023 14:15 UTC

28 points

13 comments3 min readLW link

(forum.effectivealtruism.org)

“Safety Culture for AI” is important, but isn’t going to be easy

Davidmanheim26 Jun 2023 12:52 UTC

47 points

2 comments2 min readLW link

(forum.effectivealtruism.org)

“LLMs Don’t Have a Coherent Model of the World”—What it Means, Why it Matters

Davidmanheim1 Jun 2023 7:46 UTC

31 points

2 comments7 min readLW link

Systems that cannot be unsafe cannot be safe

Davidmanheim2 May 2023 8:53 UTC

62 points

27 comments2 min readLW link

Beyond a better world

Davidmanheim14 Dec 2022 10:18 UTC

14 points

7 comments4 min readLW link

(progressforum.org)

Far-UVC Light Update: No, LEDs are not around the corner (tweetstorm)

Davidmanheim2 Nov 2022 12:57 UTC

70 points

27 comments4 min readLW link

(twitter.com)

Announcing AISIC 2022 - the AI Safety Israel Conference, October 19-20

Davidmanheim21 Sep 2022 19:32 UTC

13 points

0 comments1 min readLW link

Rehovot, Israel – ACX Meetups Everywhere 2022

Davidmanheim25 Aug 2022 18:01 UTC

3 points

0 comments1 min readLW link

AI Governance across Slow/Fast Takeoff and Easy/Hard Alignment spectra

Davidmanheim3 Apr 2022 7:45 UTC

27 points

6 comments3 min readLW link

Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety

riceissa and Davidmanheim

27 Jan 2022 13:13 UTC

27 points

0 comments1 min readLW link

(arxiv.org)

Elicitation for Modeling Transformative AI Risks

Davidmanheim16 Dec 2021 15:24 UTC

30 points

2 comments9 min readLW link

Modelling Transformative AI Risks (MTAIR) Project: Introduction

Davidmanheim and Aryeh Englander

16 Aug 2021 7:12 UTC

91 points

0 comments9 min readLW link