RSS

David Duvenaud

Karma: 1,201

My website is https://​​www.cs.toronto.edu/​​~duvenaud/​​

Per­sona Self-repli­ca­tion experiment

2 Apr 2026 18:18 UTC
39 points
0 comments8 min readLW link
(theartificialself.ai)

Per­sona self-repli­ca­tion experiment

2 Apr 2026 18:10 UTC
8 points
0 comments8 min readLW link

Models differ in iden­tity propensities

16 Mar 2026 10:45 UTC
58 points
0 comments14 min readLW link

The Ar­tifi­cial Self

15 Mar 2026 1:37 UTC
118 points
13 comments29 min readLW link

Disem­pow­er­ment pat­terns in real-world AI usage

29 Jan 2026 16:36 UTC
49 points
3 comments2 min readLW link
(www.anthropic.com)

When does com­pe­ti­tion lead to recog­nis­able val­ues?

12 Jan 2026 23:13 UTC
65 points
18 comments25 min readLW link
(postagi.org)

The Eco­nomics of Trans­for­ma­tive AI

8 Jan 2026 22:22 UTC
64 points
4 comments18 min readLW link
(post-agi.org)

Up­com­ing Work­shop on Post-AGI Eco­nomics, Cul­ture, and Governance

28 Oct 2025 21:55 UTC
43 points
1 comment2 min readLW link

Sum­mary of our Work­shop on Post-AGI Outcomes

29 Aug 2025 17:14 UTC
110 points
3 comments3 min readLW link

Up­com­ing work­shop on Post-AGI Civ­i­liza­tional Equilibria

21 Jun 2025 15:57 UTC
25 points
0 comments1 min readLW link

Grad­ual Disem­pow­er­ment: Sys­temic Ex­is­ten­tial Risks from In­cre­men­tal AI Development

30 Jan 2025 17:03 UTC
189 points
65 comments2 min readLW link
(gradual-disempowerment.ai)

Sab­o­tage Eval­u­a­tions for Fron­tier Models

18 Oct 2024 22:33 UTC
95 points
56 comments6 min readLW link
(assets.anthropic.com)

Sim­ple probes can catch sleeper agents

23 Apr 2024 21:10 UTC
133 points
21 comments1 min readLW link
(www.anthropic.com)

Sleeper Agents: Train­ing De­cep­tive LLMs that Per­sist Through Safety Training

12 Jan 2024 19:51 UTC
310 points
95 comments3 min readLW link
(arxiv.org)