RSS

Joe Carlsmith

Karma: 5,159

Senior research analyst at Open Philanthropy. Doctorate in philosophy from the University of Oxford. Opinions my own.

A frame­work for think­ing about AI power-seeking

Joe CarlsmithJul 24, 2024, 10:41 PM
62 points
15 comments16 min readLW link

Lov­ing a world you don’t trust

Joe CarlsmithJun 18, 2024, 7:31 PM
135 points
13 comments33 min readLW link

On “first crit­i­cal tries” in AI alignment

Joe CarlsmithJun 5, 2024, 12:19 AM
54 points
8 comments14 min readLW link

On attunement

Joe CarlsmithMar 25, 2024, 12:47 PM
100 points
12 comments22 min readLW link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe CarlsmithMar 22, 2024, 3:52 PM
32 points
1 comment32 min readLW link

On green

Joe CarlsmithMar 21, 2024, 5:38 PM
269 points
35 comments31 min readLW link

On the abo­li­tion of man

Joe CarlsmithJan 18, 2024, 6:17 PM
90 points
18 comments41 min readLW link

Be­ing nicer than Clippy

Joe CarlsmithJan 16, 2024, 7:44 PM
109 points
32 comments27 min readLW link

An even deeper atheism

Joe CarlsmithJan 11, 2024, 5:28 PM
125 points
47 comments15 min readLW link

Does AI risk “other” the AIs?

Joe CarlsmithJan 9, 2024, 5:51 PM
60 points
3 comments8 min readLW link

When “yang” goes wrong

Joe CarlsmithJan 8, 2024, 4:35 PM
73 points
6 comments13 min readLW link

Deep athe­ism and AI risk

Joe CarlsmithJan 4, 2024, 6:58 PM
153 points
22 comments27 min readLW link

Gentle­ness and the ar­tifi­cial Other

Joe CarlsmithJan 2, 2024, 6:21 PM
313 points
33 comments11 min readLW link

Oth­er­ness and con­trol in the age of AGI

Joe CarlsmithJan 2, 2024, 6:15 PM
43 points
0 comments7 min readLW link