RSS

Joe Carlsmith

Karma: 3,751

Senior research analyst at Open Philanthropy. Recently completed a doctorate in philosophy at the University of Oxford. Opinions my own.

On attunement

Joe Carlsmith25 Mar 2024 12:47 UTC
91 points
8 comments22 min readLW link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe Carlsmith22 Mar 2024 15:52 UTC
31 points
1 comment32 min readLW link

On green

Joe Carlsmith21 Mar 2024 17:38 UTC
249 points
33 comments31 min readLW link

On the abo­li­tion of man

Joe Carlsmith18 Jan 2024 18:17 UTC
88 points
18 comments41 min readLW link

Be­ing nicer than Clippy

Joe Carlsmith16 Jan 2024 19:44 UTC
106 points
22 comments27 min readLW link

An even deeper atheism

Joe Carlsmith11 Jan 2024 17:28 UTC
124 points
47 comments15 min readLW link

Does AI risk “other” the AIs?

Joe Carlsmith9 Jan 2024 17:51 UTC
59 points
3 comments8 min readLW link

When “yang” goes wrong

Joe Carlsmith8 Jan 2024 16:35 UTC
72 points
6 comments13 min readLW link

Deep athe­ism and AI risk

Joe Carlsmith4 Jan 2024 18:58 UTC
130 points
22 comments27 min readLW link

Gentle­ness and the ar­tifi­cial Other

Joe Carlsmith2 Jan 2024 18:21 UTC
265 points
32 comments11 min readLW link

Oth­er­ness and con­trol in the age of AGI

Joe Carlsmith2 Jan 2024 18:15 UTC
30 points
0 comments7 min readLW link

Em­piri­cal work that might shed light on schem­ing (Sec­tion 6 of “Schem­ing AIs”)

Joe Carlsmith11 Dec 2023 16:30 UTC
8 points
0 comments21 min readLW link

Sum­ming up “Schem­ing AIs” (Sec­tion 5)

Joe Carlsmith9 Dec 2023 15:48 UTC
2 points
0 comments11 min readLW link

Speed ar­gu­ments against schem­ing (Sec­tion 4.4-4.7 of “Schem­ing AIs”)

Joe Carlsmith8 Dec 2023 21:09 UTC
9 points
0 comments15 min readLW link