RSS

Davidmanheim

Karma: 5,514

12 An­gry Agents, or: A Plan for AI Empathy

14 Oct 2025 15:24 UTC
21 points
4 comments12 min readLW link

Messy on Pur­pose: Part 2 of A Con­ser­va­tive Vi­sion for the Future

7 Oct 2025 17:00 UTC
14 points
3 comments12 min readLW link

The Coun­ter­fac­tual Quiet AGI Timeline

Davidmanheim5 Oct 2025 9:09 UTC
66 points
5 comments9 min readLW link

A Con­ser­va­tive Vi­sion For AI Alignment

21 Aug 2025 18:14 UTC
25 points
34 comments12 min readLW link

Semiotic Ground­ing as a Pre­con­di­tion for Safe and Co­op­er­a­tive AI

Davidmanheim27 Jul 2025 16:11 UTC
22 points
0 comments6 min readLW link

No, We’re Not Get­ting Mean­ingful Over­sight of AI

Davidmanheim9 Jul 2025 11:10 UTC
42 points
4 comments1 min readLW link
(arxiv.org)

The Frag­ility of Naive Dynamism

Davidmanheim19 May 2025 7:51 UTC
20 points
1 comment17 min readLW link

Ther­a­pist in the Weights: Risks of Hyper-In­tro­spec­tion in Fu­ture AI Systems

Davidmanheim28 Apr 2025 6:42 UTC
15 points
1 comment5 min readLW link

Grounded Ghosts in the Ma­chine—Fris­ton Blan­kets, Mir­ror Neu­rons, and the Quest for Co­op­er­a­tive AI

Davidmanheim10 Apr 2025 10:15 UTC
9 points
0 comments9 min readLW link
(davidmanheim.com)

David­man­heim’s Shortform

Davidmanheim16 Jan 2025 8:23 UTC
7 points
18 comments1 min readLW link

Ex­plor­ing Co­op­er­a­tion: The Path to Utopia

Davidmanheim25 Dec 2024 18:31 UTC
11 points
0 comments14 min readLW link
(exploringcooperation.substack.com)

Moder­ately Skep­ti­cal of “Risks of Mir­ror Biol­ogy”

Davidmanheim20 Dec 2024 12:57 UTC
31 points
3 comments9 min readLW link
(substack.com)

Most Minds are Irrational

Davidmanheim10 Dec 2024 9:36 UTC
17 points
4 comments10 min readLW link

Re­fut­ing Searle’s wall, Put­nam’s rock, and John­son’s popcorn

Davidmanheim9 Dec 2024 8:24 UTC
9 points
31 comments1 min readLW link

Miti­gat­ing Geo­mag­netic Storm and EMP Risks to the Elec­tri­cal Grid (Shal­low Dive)

Davidmanheim26 Nov 2024 8:00 UTC
16 points
4 comments6 min readLW link

Prove­ably Safe Self Driv­ing Cars [Mo­dulo As­sump­tions]

Davidmanheim15 Sep 2024 13:58 UTC
27 points
29 comments8 min readLW link

Are LLMs on the Path to AGI?

Davidmanheim30 Aug 2024 3:14 UTC
14 points
2 comments5 min readLW link

Scal­ing Laws and Likely Limits to AI

Davidmanheim18 Aug 2024 17:19 UTC
19 points
0 comments3 min readLW link

Mis­nam­ing and Other Is­sues with OpenAI’s “Hu­man Level” Su­per­in­tel­li­gence Hierarchy

Davidmanheim15 Jul 2024 5:50 UTC
49 points
2 comments3 min readLW link

Biorisk is an Un­helpful Anal­ogy for AI Risk

Davidmanheim6 May 2024 6:20 UTC
4 points
17 comments3 min readLW link