Page 1

What failure looks like

paulfchristiano
17 Mar 2019 20:18 UTC
198 points
42 comments8 min readLW link

Per­son­al­ized Medicine For Real

sarahconstantin
4 Mar 2019 22:40 UTC
197 points
15 comments5 min readLW link

Align­ment Re­search Field Guide

abramdemski
8 Mar 2019 19:57 UTC
194 points
5 comments17 min readLW link

“Other peo­ple are wrong” vs “I am right”

Buck
22 Feb 2019 20:01 UTC
188 points
14 comments9 min readLW link

Build­ing up to an In­ter­nal Fam­ily Sys­tems model

Kaj_Sotala
26 Jan 2019 12:25 UTC
148 points
27 comments28 min readLW link

The Re­la­tion­ship Between the Village and the Mission

Raemon
12 May 2019 21:09 UTC
132 points
66 comments18 min readLW link

The 3 Books Tech­nique for Learn­ing a New Skilll

mr-hire
9 Jan 2019 12:45 UTC
126 points
29 comments2 min readLW link

Thoughts on Hu­man Models

xrchz
21 Feb 2019 9:10 UTC
119 points
21 comments10 min readLW link

Hu­mans Who Are Not Con­cen­trat­ing Are Not Gen­eral Intelligences

sarahconstantin
25 Feb 2019 20:40 UTC
119 points
29 comments6 min readLW link

Asym­met­ric Justice

Zvi
25 Apr 2019 16:00 UTC
118 points
81 comments5 min readLW link

The Amish, and Strate­gic Norms around Technology

Raemon
24 Mar 2019 22:16 UTC
114 points
9 comments3 min readLW link

Disen­tan­gling ar­gu­ments for the im­por­tance of AI safety

ricraz
21 Jan 2019 12:41 UTC
113 points
21 comments8 min readLW link

Rest Days vs Re­cov­ery Days

Unreal
19 Mar 2019 22:37 UTC
113 points
20 comments6 min readLW link

Some Thoughts on My Psy­chi­a­try Practice

Laura B
16 Jan 2019 23:16 UTC
106 points
31 comments4 min readLW link

An­nounc­ing the Cen­ter for Ap­plied Postrationality

DonyChristie
2 Apr 2019 1:17 UTC
105 points
14 comments1 min readLW link

Co­her­ent de­ci­sions im­ply con­sis­tent utilities

Eliezer Yudkowsky
12 May 2019 21:33 UTC
104 points
49 comments26 min readLW link

Offer of col­lab­o­ra­tion and/​or mentorship

Vanessa Kosoy
16 May 2019 14:16 UTC
103 points
10 comments2 min readLW link

De­grees of Freedom

sarahconstantin
2 Apr 2019 21:10 UTC
101 points
26 comments11 min readLW link

Rule Thinkers In, Not Out

Scott Alexander
27 Feb 2019 2:40 UTC
99 points
41 comments4 min readLW link

Yes Re­quires the Pos­si­bil­ity of No

Scott Garrabrant
17 May 2019 22:39 UTC
98 points
11 comments2 min readLW link

S-Curves for Trend Forecasting

mr-hire
23 Jan 2019 18:17 UTC
97 points
14 comments3 min readLW link

Karma-Change Notifications

jimrandomh
2 Mar 2019 2:52 UTC
95 points
42 comments1 min readLW link

Liter­a­ture Re­view: Distributed Teams

Elizabeth
16 Apr 2019 1:19 UTC
94 points
32 comments6 min readLW link

Align­ment Newslet­ter One Year Retrospective

rohinmshah
10 Apr 2019 6:58 UTC
93 points
31 comments21 min readLW link

Refram­ing Su­per­in­tel­li­gence: Com­pre­hen­sive AI Ser­vices as Gen­eral Intelligence

rohinmshah
8 Jan 2019 7:12 UTC
91 points
67 comments5 min readLW link
(www.fhi.ox.ac.uk)

Book Sum­mary: Con­scious­ness and the Brain

Kaj_Sotala
16 Jan 2019 14:43 UTC
90 points
13 comments26 min readLW link

The Hard Work of Trans­la­tion (Bud­dhism)

romeostevensit
7 Apr 2019 21:04 UTC
90 points
122 comments5 min readLW link

Prob­a­bil­ity space has 2 metrics

Donald Hobson
10 Feb 2019 0:28 UTC
88 points
11 comments1 min readLW link

RAISE is launch­ing their MVP

toonalfrink
26 Feb 2019 11:45 UTC
85 points
1 comment1 min readLW link

Subagents, akra­sia, and co­her­ence in humans

Kaj_Sotala
25 Mar 2019 14:24 UTC
85 points
27 comments17 min readLW link

Se­quence in­tro­duc­tion: non-agent and mul­ti­a­gent mod­els of mind

Kaj_Sotala
7 Jan 2019 14:12 UTC
83 points
4 comments7 min readLW link

Counterspells

Virgil Kurkjian
27 Apr 2019 23:37 UTC
83 points
24 comments10 min readLW link

[Question] Why is so much dis­cus­sion hap­pen­ing in pri­vate Google Docs?

Wei_Dai
12 Jan 2019 2:19 UTC
82 points
21 comments1 min readLW link

Less Com­pe­ti­tion, More Mer­i­toc­racy?

Zvi
20 Jan 2019 2:00 UTC
81 points
13 comments20 min readLW link

The Forces of Bland­ness and the Disagree­able Majority

sarahconstantin
28 Apr 2019 19:44 UTC
81 points
20 comments3 min readLW link

An­nounce­ment: AI al­ign­ment prize round 4 winners

cousin_it
20 Jan 2019 14:46 UTC
80 points
41 comments1 min readLW link

Privacy

Zvi
15 Mar 2019 20:20 UTC
79 points
78 comments6 min readLW link

From Per­sonal to Pri­son Gangs: En­forc­ing Proso­cial Behavior

johnswentworth
24 Jan 2019 18:07 UTC
78 points
8 comments5 min readLW link

Com­plex Be­hav­ior from Sim­ple (Sub)Agents

moridinamael
10 May 2019 21:44 UTC
78 points
6 comments9 min readLW link

Book Re­view: The Struc­ture Of Scien­tific Revolutions

Scott Alexander
9 Jan 2019 7:10 UTC
75 points
24 comments19 min readLW link

[Question] Best rea­sons for pes­simism about im­pact of im­pact mea­sures?

TurnTrout
10 Apr 2019 17:22 UTC
75 points
51 comments3 min readLW link

Strat­egy is the De­con­fu­sion of Action

ryan_b
2 Jan 2019 20:56 UTC
73 points
4 comments6 min readLW link

[Question] How does Gra­di­ent Des­cent In­ter­act with Good­hart?

Scott Garrabrant
2 Feb 2019 0:14 UTC
70 points
15 comments4 min readLW link

He­len Toner on China, CSET, and AI

Rob Bensinger
21 Apr 2019 4:10 UTC
70 points
3 comments7 min readLW link
(rationallyspeakingpodcast.org)

Com­ments on CAIS

ricraz
12 Jan 2019 15:20 UTC
69 points
12 comments7 min readLW link

Un­con­scious Economies

jacobjacob
27 Feb 2019 12:58 UTC
69 points
19 comments4 min readLW link

Three ways that “Suffi­ciently op­ti­mized agents ap­pear co­her­ent” can be false

Wei_Dai
5 Mar 2019 21:52 UTC
68 points
2 comments3 min readLW link

Ac­tive Cu­ri­os­ity vs Open Curiosity

Unreal
15 Mar 2019 16:54 UTC
68 points
19 comments3 min readLW link

Blackmail

Zvi
19 Feb 2019 3:50 UTC
67 points
45 comments15 min readLW link

Co­or­di­na­tion Sur­veys: why we should sur­vey to or­ga­nize re­spon­si­bil­ities, not just predictions

Academian
7 May 2019 17:43 UTC
67 points
3 comments3 min readLW link