Page 1

What failure looks like

paulfchristiano
17 Mar 2019 20:18 UTC
204 points
42 comments8 min readLW link

Per­son­al­ized Medicine For Real

sarahconstantin
4 Mar 2019 22:40 UTC
199 points
15 comments5 min readLW link
(srconstantin.wordpress.com)

Align­ment Re­search Field Guide

abramdemski
8 Mar 2019 19:57 UTC
195 points
5 comments17 min readLW link

“Other peo­ple are wrong” vs “I am right”

Buck
22 Feb 2019 20:01 UTC
190 points
15 comments9 min readLW link

Build­ing up to an In­ter­nal Fam­ily Sys­tems model

Kaj_Sotala
26 Jan 2019 12:25 UTC
148 points
27 comments28 min readLW link

Asym­met­ric Justice

Zvi
25 Apr 2019 16:00 UTC
145 points
85 comments5 min readLW link
(thezvi.wordpress.com)

Steel­man­ning Divination

Vaniver
5 Jun 2019 22:53 UTC
137 points
34 comments6 min readLW link

The 3 Books Tech­nique for Learn­ing a New Skilll

mr-hire
9 Jan 2019 12:45 UTC
135 points
31 comments2 min readLW link

Mis­takes with Con­ser­va­tion of Ex­pected Evidence

abramdemski
8 Jun 2019 23:07 UTC
132 points
14 comments12 min readLW link

The Re­la­tion­ship Between the Village and the Mission

Raemon
12 May 2019 21:09 UTC
129 points
67 comments18 min readLW link

Yes Re­quires the Pos­si­bil­ity of No

Scott Garrabrant
17 May 2019 22:39 UTC
126 points
39 comments2 min readLW link

Thoughts on Hu­man Models

xrchz
21 Feb 2019 9:10 UTC
122 points
21 comments10 min readLW link

Be­ing the (Pareto) Best in the World

johnswentworth
24 Jun 2019 18:36 UTC
121 points
18 comments3 min readLW link

Hu­mans Who Are Not Con­cen­trat­ing Are Not Gen­eral Intelligences

sarahconstantin
25 Feb 2019 20:40 UTC
120 points
29 comments6 min readLW link
(srconstantin.wordpress.com)

Co­her­ent de­ci­sions im­ply con­sis­tent utilities

Eliezer Yudkowsky
12 May 2019 21:33 UTC
118 points
53 comments26 min readLW link

Rest Days vs Re­cov­ery Days

Unreal
19 Mar 2019 22:37 UTC
114 points
20 comments6 min readLW link

The Amish, and Strate­gic Norms around Technology

Raemon
24 Mar 2019 22:16 UTC
114 points
11 comments3 min readLW link

Book Re­view: The Se­cret Of Our Success

Scott Alexander
5 Jun 2019 6:50 UTC
114 points
5 comments25 min readLW link
(slatestarcodex.com)

Disen­tan­gling ar­gu­ments for the im­por­tance of AI safety

ricraz
21 Jan 2019 12:41 UTC
113 points
21 comments8 min readLW link

Risks from Learned Op­ti­miza­tion: Introduction

evhub
31 May 2019 23:44 UTC
110 points
30 comments12 min readLW link

Offer of col­lab­o­ra­tion and/​or mentorship

Vanessa Kosoy
16 May 2019 14:16 UTC
109 points
12 comments2 min readLW link

In­tegrity and ac­countabil­ity are core parts of rationality

habryka
15 Jul 2019 20:22 UTC
108 points
58 comments6 min readLW link

An­nounc­ing the Cen­ter for Ap­plied Postrationality

DonyChristie
2 Apr 2019 1:17 UTC
107 points
14 comments1 min readLW link

Some Thoughts on My Psy­chi­a­try Practice

Laura B
16 Jan 2019 23:16 UTC
106 points
31 comments4 min readLW link

Rea­son isn’t magic

Benquo
18 Jun 2019 4:04 UTC
104 points
9 comments2 min readLW link
(benjaminrosshoffman.com)

De­grees of Freedom

sarahconstantin
2 Apr 2019 21:10 UTC
103 points
27 comments11 min readLW link
(srconstantin.wordpress.com)

Jeff Hawk­ins on neu­ro­mor­phic AGI within 20 years

steve2152
15 Jul 2019 19:16 UTC
103 points
6 comments12 min readLW link

Rule Thinkers In, Not Out

Scott Alexander
27 Feb 2019 2:40 UTC
101 points
43 comments4 min readLW link
(slatestarcodex.com)

S-Curves for Trend Forecasting

mr-hire
23 Jan 2019 18:17 UTC
100 points
14 comments3 min readLW link

Selec­tion vs Control

abramdemski
2 Jun 2019 7:01 UTC
98 points
14 comments11 min readLW link

Liter­a­ture Re­view: Distributed Teams

Elizabeth
16 Apr 2019 1:19 UTC
96 points
33 comments6 min readLW link

Ar­bital scrape

emmab
6 Jun 2019 23:11 UTC
96 points
23 comments1 min readLW link

A Key Power of the Pres­i­dent is to Co­or­di­nate the Ex­e­cu­tion of Ex­ist­ing Con­crete Plans

Benito
16 Jul 2019 5:06 UTC
96 points
13 comments10 min readLW link

Karma-Change Notifications

jimrandomh
2 Mar 2019 2:52 UTC
95 points
42 comments1 min readLW link

Align­ment Newslet­ter One Year Retrospective

rohinmshah
10 Apr 2019 6:58 UTC
93 points
31 comments21 min readLW link

[Question] Where are peo­ple think­ing and talk­ing about global co­or­di­na­tion for AI safety?

Wei_Dai
22 May 2019 6:24 UTC
93 points
9 comments1 min readLW link

Book Sum­mary: Con­scious­ness and the Brain

Kaj_Sotala
16 Jan 2019 14:43 UTC
92 points
14 comments26 min readLW link

Com­plex Be­hav­ior from Sim­ple (Sub)Agents

moridinamael
10 May 2019 21:44 UTC
92 points
8 comments9 min readLW link

Refram­ing Su­per­in­tel­li­gence: Com­pre­hen­sive AI Ser­vices as Gen­eral Intelligence

rohinmshah
8 Jan 2019 7:12 UTC
91 points
67 comments5 min readLW link
(www.fhi.ox.ac.uk)

The Schel­ling Choice is “Rab­bit”, not “Stag”

Raemon
8 Jun 2019 0:24 UTC
91 points
25 comments12 min readLW link

Writ­ing chil­dren’s pic­ture books

jessicata
25 Jun 2019 21:43 UTC
91 points
14 comments5 min readLW link
(unstableontology.com)

The Hard Work of Trans­la­tion (Bud­dhism)

romeostevensit
7 Apr 2019 21:04 UTC
90 points
122 comments5 min readLW link

Prob­a­bil­ity space has 2 metrics

Donald Hobson
10 Feb 2019 0:28 UTC
89 points
11 comments1 min readLW link

Say Wrong Things

G Gordon Worley III
24 May 2019 22:11 UTC
89 points
10 comments4 min readLW link

No, it’s not The In­cen­tives—it’s you

Zack_M_Davis
11 Jun 2019 7:09 UTC
89 points
92 comments1 min readLW link
(www.talyarkoni.org)

[Question] Why is so much dis­cus­sion hap­pen­ing in pri­vate Google Docs?

Wei_Dai
12 Jan 2019 2:19 UTC
86 points
21 comments1 min readLW link

The Forces of Bland­ness and the Disagree­able Majority

sarahconstantin
28 Apr 2019 19:44 UTC
86 points
20 comments3 min readLW link
(srconstantin.wordpress.com)

Se­quence in­tro­duc­tion: non-agent and mul­ti­a­gent mod­els of mind

Kaj_Sotala
7 Jan 2019 14:12 UTC
85 points
4 comments7 min readLW link

RAISE is launch­ing their MVP

toonalfrink
26 Feb 2019 11:45 UTC
85 points
1 comment1 min readLW link

Subagents, akra­sia, and co­her­ence in humans

Kaj_Sotala
25 Mar 2019 14:24 UTC
85 points
27 comments17 min readLW link