Power Buys You Dis­tance From The Crime

ElizabethAug 2, 2019, 8:50 PM
213 points
75 comments7 min readLW link1 review
(acesounderglass.com)

Why Subagents?

johnswentworthAug 1, 2019, 10:17 PM
175 points
48 comments7 min readLW link1 review

The Com­mit­ment Races problem

Daniel KokotajloAug 23, 2019, 1:58 AM
159 points
56 comments5 min readLW link

Soft take­off can still lead to de­ci­sive strate­gic advantage

Daniel KokotajloAug 23, 2019, 4:39 PM
122 points
47 comments8 min readLW link4 reviews

Subagents, trauma and rationality

Kaj_SotalaAug 14, 2019, 1:14 PM
113 points
4 comments19 min readLW link

Trauma, Med­i­ta­tion, and a Cool Scar

Logan RiggsAug 6, 2019, 4:17 PM
102 points
17 comments5 min readLW link1 review

[Question] Can we re­ally pre­vent all warm­ing for less than 10B$ with the mostly side-effect free geo­eng­ineer­ing tech­nique of Marine Cloud Bright­en­ing?

mako yassAug 5, 2019, 12:12 AM
94 points
55 comments2 min readLW link

Par­tial sum­mary of de­bate with Ben­quo and Jes­si­cata [pt 1]

RaemonAug 14, 2019, 8:02 PM
89 points
63 comments22 min readLW link3 reviews

Subagents, neu­ral Tur­ing ma­chines, thought se­lec­tion, and blindspots

Kaj_SotalaAug 6, 2019, 9:15 PM
87 points
3 comments12 min readLW link

Troll Bridge

abramdemskiAug 23, 2019, 6:36 PM
86 points
59 comments12 min readLW link

2-D Robustness

Vlad MikulikAug 30, 2019, 8:27 PM
85 points
8 comments2 min readLW link

Prob­lems in AI Align­ment that philoso­phers could po­ten­tially con­tribute to

Wei DaiAug 17, 2019, 5:38 PM
79 points
14 comments2 min readLW link

Clar­ify­ing some key hy­pothe­ses in AI alignment

Aug 15, 2019, 9:29 PM
79 points
12 comments9 min readLW link

Mar­kets are Univer­sal for Log­i­cal Induction

johnswentworthAug 22, 2019, 6:44 AM
75 points
2 comments5 min readLW link

Six AI Risk/​Strat­egy Ideas

Wei DaiAug 27, 2019, 12:40 AM
73 points
17 comments4 min readLW link1 review

Clas­sify­ing speci­fi­ca­tion prob­lems as var­i­ants of Good­hart’s Law

VikaAug 19, 2019, 8:40 PM
72 points
5 comments5 min readLW link1 review

[Question] Does Agent-like Be­hav­ior Im­ply Agent-like Ar­chi­tec­ture?

Scott GarrabrantAug 23, 2019, 2:01 AM
69 points
8 comments1 min readLW link

Re­sponse to Glen Weyl on Tech­noc­racy and the Ra­tion­al­ist Community

John_MaxwellAug 22, 2019, 11:14 PM
66 points
9 comments10 min readLW link

[Question] Why so much var­i­ance in hu­man in­tel­li­gence?

Ben PaceAug 22, 2019, 10:36 PM
65 points
28 comments4 min readLW link

Book Re­view: Sec­u­lar Cycles

Scott AlexanderAug 13, 2019, 4:10 AM
62 points
10 comments16 min readLW link1 review
(slatestarcodex.com)

Dual Wielding

ZviAug 27, 2019, 2:10 PM
60 points
23 comments2 min readLW link3 reviews
(thezvi.wordpress.com)

How to Make Billions of Dol­lars Re­duc­ing Loneliness

John_MaxwellAug 30, 2019, 5:30 PM
60 points
32 comments7 min readLW link

Schel­ling Cat­e­gories, and Sim­ple Mem­ber­ship Tests

Zack_M_DavisAug 26, 2019, 2:43 AM
59 points
10 comments8 min readLW link

Ta­boo­ing ‘Agent’ for Pro­saic Alignment

Hjalmar_WijkAug 23, 2019, 2:55 AM
57 points
10 comments6 min readLW link

Ac­tu­ally updating

SaraHaxAug 23, 2019, 5:46 PM
56 points
10 comments4 min readLW link

In­ten­tional Bucket Errors

Scott GarrabrantAug 22, 2019, 8:02 PM
55 points
6 comments3 min readLW link

Com­pu­ta­tional Model: Causal Di­a­grams with Symmetry

johnswentworthAug 22, 2019, 5:54 PM
53 points
29 comments4 min readLW link

Zeno walks into a bar

lsusrAug 4, 2019, 7:00 AM
53 points
4 comments2 min readLW link

Per­mis­sions in Governance

sarahconstantinAug 2, 2019, 7:50 PM
53 points
12 comments8 min readLW link
(srconstantin.wordpress.com)

A Per­sonal Ra­tion­al­ity Wishlist

DanielFilanAug 27, 2019, 3:40 AM
53 points
54 comments4 min readLW link
(danielfilan.com)

AI Fore­cast­ing Dic­tionary (Fore­cast­ing in­fras­truc­ture, part 1)

Aug 8, 2019, 4:10 PM
50 points
0 comments5 min readLW link

Vaniver’s View on Fac­tored Cognition

VaniverAug 23, 2019, 2:54 AM
48 points
4 comments8 min readLW link

Sta­tus 451 on Di­ag­no­sis: Rus­sell Aphasia

Zack_M_DavisAug 6, 2019, 4:43 AM
48 points
1 comment1 min readLW link
(status451.com)

Towards a mechanis­tic un­der­stand­ing of corrigibility

evhubAug 22, 2019, 11:20 PM
47 points
26 comments4 min readLW link

Septem­ber Brag­ging Thread

RaemonAug 30, 2019, 9:58 PM
47 points
12 comments1 min readLW link

[Link] Book Re­view: Refram­ing Su­per­in­tel­li­gence (SSC)

ioannesAug 28, 2019, 10:57 PM
46 points
9 comments2 min readLW link

[Question] How Can Peo­ple Eval­u­ate Com­plex Ques­tions Con­sis­tently?

ElizabethAug 26, 2019, 8:33 PM
46 points
12 comments1 min readLW link

New pa­per: Cor­rigi­bil­ity with Utility Preservation

Koen.HoltmanAug 6, 2019, 7:04 PM
44 points
11 comments2 min readLW link

Embed­ded Agency via Abstraction

johnswentworthAug 26, 2019, 11:03 PM
42 points
20 comments11 min readLW link

My recom­men­da­tions for grat­i­tude exercises

MaxCarpendaleAug 5, 2019, 7:04 PM
40 points
3 comments5 min readLW link

The Miss­ing Math of Map-Making

johnswentworthAug 28, 2019, 9:18 PM
40 points
8 comments2 min readLW link

LW Team Up­dates—Septem­ber 2019

RubyAug 29, 2019, 10:12 PM
39 points
13 comments2 min readLW link

Epistemic Spot Check: The Fate of Rome (Kyle Harper)

ElizabethAug 24, 2019, 9:40 PM
39 points
3 comments5 min readLW link
(acesounderglass.com)

Call for con­trib­u­tors to the Align­ment Newsletter

Rohin ShahAug 21, 2019, 6:21 PM
39 points
0 comments4 min readLW link

Cephaloponderings

Jacob FalkovichAug 4, 2019, 4:45 PM
39 points
4 comments7 min readLW link

Op­ti­miza­tion Provenance

Adele LopezAug 23, 2019, 8:08 PM
38 points
5 comments5 min readLW link

Unstriving

Jacob FalkovichAug 19, 2019, 2:31 PM
38 points
7 comments6 min readLW link

Di­ana Fleischman and Ge­offrey Miller—Au­di­ence Q&A

Jacob FalkovichAug 10, 2019, 10:37 PM
38 points
6 comments9 min readLW link

Mis­take Ver­sus Con­flict The­ory of Against Billion­aire Philanthropy

ZviAug 1, 2019, 1:10 PM
37 points
34 comments3 min readLW link
(thezvi.wordpress.com)

Two senses of “op­ti­mizer”

Joar SkalseAug 21, 2019, 4:02 PM
35 points
41 comments3 min readLW link