RSS

Joe Carlsmith

Karma: 4,068

Senior research analyst at Open Philanthropy. Recently completed a doctorate in philosophy at the University of Oxford. Opinions my own.

A frame­work for think­ing about AI power-seeking

Joe Carlsmith24 Jul 2024 22:41 UTC
53 points
6 comments16 min readLW link

Lov­ing a world you don’t trust

Joe Carlsmith18 Jun 2024 19:31 UTC
126 points
13 comments33 min readLW link

On “first crit­i­cal tries” in AI alignment

Joe Carlsmith5 Jun 2024 0:19 UTC
54 points
4 comments14 min readLW link

On attunement

Joe Carlsmith25 Mar 2024 12:47 UTC
92 points
8 comments22 min readLW link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe Carlsmith22 Mar 2024 15:52 UTC
31 points
1 comment32 min readLW link

On green

Joe Carlsmith21 Mar 2024 17:38 UTC
261 points
35 comments31 min readLW link

On the abo­li­tion of man

Joe Carlsmith18 Jan 2024 18:17 UTC
88 points
18 comments41 min readLW link

Be­ing nicer than Clippy

Joe Carlsmith16 Jan 2024 19:44 UTC
109 points
23 comments27 min readLW link

An even deeper atheism

Joe Carlsmith11 Jan 2024 17:28 UTC
125 points
47 comments15 min readLW link

Does AI risk “other” the AIs?

Joe Carlsmith9 Jan 2024 17:51 UTC
59 points
3 comments8 min readLW link

When “yang” goes wrong

Joe Carlsmith8 Jan 2024 16:35 UTC
72 points
6 comments13 min readLW link

Deep athe­ism and AI risk

Joe Carlsmith4 Jan 2024 18:58 UTC
141 points
22 comments27 min readLW link

Gentle­ness and the ar­tifi­cial Other

Joe Carlsmith2 Jan 2024 18:21 UTC
282 points
33 comments11 min readLW link