RSS

Joe Carlsmith

Karma: 3,751

Senior research analyst at Open Philanthropy. Recently completed a doctorate in philosophy at the University of Oxford. Opinions my own.

On attunement

Joe Carlsmith25 Mar 2024 12:47 UTC
91 points
8 comments22 min readLW link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe Carlsmith22 Mar 2024 15:52 UTC
31 points
1 comment32 min readLW link

On green

Joe Carlsmith21 Mar 2024 17:38 UTC
249 points
33 comments31 min readLW link

On the abo­li­tion of man

Joe Carlsmith18 Jan 2024 18:17 UTC
88 points
18 comments41 min readLW link

Be­ing nicer than Clippy

Joe Carlsmith16 Jan 2024 19:44 UTC
106 points
22 comments27 min readLW link

An even deeper atheism

Joe Carlsmith11 Jan 2024 17:28 UTC
124 points
47 comments15 min readLW link

Does AI risk “other” the AIs?

Joe Carlsmith9 Jan 2024 17:51 UTC
59 points
3 comments8 min readLW link

When “yang” goes wrong

Joe Carlsmith8 Jan 2024 16:35 UTC
72 points
6 comments13 min readLW link

Deep athe­ism and AI risk

Joe Carlsmith4 Jan 2024 18:58 UTC
130 points
22 comments27 min readLW link

Gentle­ness and the ar­tifi­cial Other

Joe Carlsmith2 Jan 2024 18:21 UTC
265 points
32 comments11 min readLW link

Oth­er­ness and con­trol in the age of AGI

Joe Carlsmith2 Jan 2024 18:15 UTC
30 points
0 comments7 min readLW link

Em­piri­cal work that might shed light on schem­ing (Sec­tion 6 of “Schem­ing AIs”)

Joe Carlsmith11 Dec 2023 16:30 UTC
8 points
0 comments21 min readLW link

Sum­ming up “Schem­ing AIs” (Sec­tion 5)

Joe Carlsmith9 Dec 2023 15:48 UTC
2 points
0 comments11 min readLW link

Speed ar­gu­ments against schem­ing (Sec­tion 4.4-4.7 of “Schem­ing AIs”)

Joe Carlsmith8 Dec 2023 21:09 UTC
9 points
0 comments15 min readLW link

Sim­plic­ity ar­gu­ments for schem­ing (Sec­tion 4.3 of “Schem­ing AIs”)

Joe Carlsmith7 Dec 2023 15:05 UTC
10 points
1 comment19 min readLW link

The count­ing ar­gu­ment for schem­ing (Sec­tions 4.1 and 4.2 of “Schem­ing AIs”)

Joe Carlsmith6 Dec 2023 19:28 UTC
8 points
0 comments10 min readLW link

Ar­gu­ments for/​against schem­ing that fo­cus on the path SGD takes (Sec­tion 3 of “Schem­ing AIs”)

Joe Carlsmith5 Dec 2023 18:48 UTC
10 points
0 comments23 min readLW link

Non-clas­sic sto­ries about schem­ing (Sec­tion 2.3.2 of “Schem­ing AIs”)

Joe Carlsmith4 Dec 2023 18:44 UTC
9 points
0 comments20 min readLW link

Does schem­ing lead to ad­e­quate fu­ture em­pow­er­ment? (Sec­tion 2.3.1.2 of “Schem­ing AIs”)

Joe Carlsmith3 Dec 2023 18:32 UTC
9 points
0 comments17 min readLW link

The goal-guard­ing hy­poth­e­sis (Sec­tion 2.3.1.1 of “Schem­ing AIs”)

Joe Carlsmith2 Dec 2023 15:20 UTC
8 points
1 comment15 min readLW link