Davidmanheim

Karma: 5,841

Security Complacency Meets Frontier AI: The Coming Collapse of ‘Secure by Apathy’

Davidmanheim25 Nov 2025 9:39 UTC

20 points

2 comments5 min readLW link

AIs should also refuse to work on capabilities research

Davidmanheim27 Oct 2025 8:42 UTC

150 points

20 comments3 min readLW link

12 Angry Agents, or: A Plan for AI Empathy

Ram Rachum and Davidmanheim

14 Oct 2025 15:24 UTC

21 points

4 comments12 min readLW link

Messy on Purpose: Part 2 of A Conservative Vision for the Future

Davidmanheim and Ram Rachum

7 Oct 2025 17:00 UTC

16 points

3 comments12 min readLW link

The Counterfactual Quiet AGI Timeline

Davidmanheim5 Oct 2025 9:09 UTC

71 points

5 comments9 min readLW link

A Conservative Vision For AI Alignment

Davidmanheim and Ram Rachum

21 Aug 2025 18:14 UTC

25 points

34 comments12 min readLW link

Semiotic Grounding as a Precondition for Safe and Cooperative AI

Davidmanheim27 Jul 2025 16:11 UTC

22 points

0 comments6 min readLW link

No, We’re Not Getting Meaningful Oversight of AI

Davidmanheim9 Jul 2025 11:10 UTC

48 points

4 comments1 min readLW link

(arxiv.org)

The Fragility of Naive Dynamism

Davidmanheim19 May 2025 7:51 UTC

20 points

1 comment17 min readLW link

Therapist in the Weights: Risks of Hyper-Introspection in Future AI Systems

Davidmanheim28 Apr 2025 6:42 UTC

15 points

1 comment5 min readLW link

Grounded Ghosts in the Machine—Friston Blankets, Mirror Neurons, and the Quest for Cooperative AI

Davidmanheim10 Apr 2025 10:15 UTC

9 points

0 comments9 min readLW link

(davidmanheim.com)

Davidmanheim’s Shortform

Davidmanheim16 Jan 2025 8:23 UTC

7 points

18 comments1 min readLW link

Exploring Cooperation: The Path to Utopia

Davidmanheim25 Dec 2024 18:31 UTC

11 points

0 comments14 min readLW link

(exploringcooperation.substack.com)

Moderately Skeptical of “Risks of Mirror Biology”

Davidmanheim20 Dec 2024 12:57 UTC

31 points

3 comments9 min readLW link

(substack.com)

Most Minds are Irrational

Davidmanheim10 Dec 2024 9:36 UTC

17 points

4 comments10 min readLW link

Refuting Searle’s wall, Putnam’s rock, and Johnson’s popcorn

Davidmanheim9 Dec 2024 8:24 UTC

9 points

31 comments1 min readLW link

Mitigating Geomagnetic Storm and EMP Risks to the Electrical Grid (Shallow Dive)

Davidmanheim26 Nov 2024 8:00 UTC

16 points

4 comments6 min readLW link

Proveably Safe Self Driving Cars [Modulo Assumptions]

Davidmanheim15 Sep 2024 13:58 UTC

27 points

29 comments8 min readLW link

Are LLMs on the Path to AGI?

Davidmanheim30 Aug 2024 3:14 UTC

14 points

2 comments5 min readLW link

Scaling Laws and Likely Limits to AI

Davidmanheim18 Aug 2024 17:19 UTC

19 points

0 comments3 min readLW link