What failure looks like

paulfchristiano
17 Mar 2019 20:18 UTC
212 points
43 comments8 min readLW link

Heads I Win, Tails?—Never Heard of Her; Or, Selec­tive Re­port­ing and the Tragedy of the Green Rationalists

Zack_M_Davis
24 Sep 2019 4:12 UTC
202 points
29 comments8 min readLW link

Align­ment Re­search Field Guide

abramdemski
8 Mar 2019 19:57 UTC
201 points
5 comments17 min readLW link

Per­son­al­ized Medicine For Real

sarahconstantin
4 Mar 2019 22:40 UTC
199 points
15 comments5 min readLW link
(srconstantin.wordpress.com)

“Other peo­ple are wrong” vs “I am right”

Buck
22 Feb 2019 20:01 UTC
195 points
15 comments9 min readLW link

The Parable of Pre­dict-O-Matic

abramdemski
15 Oct 2019 0:49 UTC
188 points
16 comments14 min readLW link

Book sum­mary: Un­lock­ing the Emo­tional Brain

Kaj_Sotala
8 Oct 2019 19:11 UTC
183 points
25 comments21 min readLW link

Be­ing the (Pareto) Best in the World

johnswentworth
24 Jun 2019 18:36 UTC
176 points
35 comments3 min readLW link

De­bate on In­stru­men­tal Con­ver­gence be­tween LeCun, Rus­sell, Ben­gio, Zador, and More

Ben Pace
4 Oct 2019 4:08 UTC
172 points
47 comments15 min readLW link

Build­ing up to an In­ter­nal Fam­ily Sys­tems model

Kaj_Sotala
26 Jan 2019 12:25 UTC
160 points
65 comments28 min readLW link

Is Ra­tion­al­ist Self-Im­prove­ment Real?

Jacobian
9 Dec 2019 17:11 UTC
159 points
60 comments11 min readLW link

Jeff Hawk­ins on neu­ro­mor­phic AGI within 20 years

steve2152
15 Jul 2019 19:16 UTC
156 points
11 comments12 min readLW link

Asym­met­ric Justice

Zvi
25 Apr 2019 16:00 UTC
149 points
87 comments5 min readLW link
(thezvi.wordpress.com)

How to Ig­nore Your Emo­tions (while also think­ing you’re awe­some at emo­tions)

Hazard
31 Jul 2019 13:34 UTC
149 points
57 comments4 min readLW link

Mis­takes with Con­ser­va­tion of Ex­pected Evidence

abramdemski
8 Jun 2019 23:07 UTC
148 points
15 comments12 min readLW link

Steel­man­ning Divination

Vaniver
5 Jun 2019 22:53 UTC
146 points
40 comments6 min readLW link

RAISE post-mortem

toonalfrink
24 Nov 2019 16:19 UTC
142 points
12 comments4 min readLW link

The 3 Books Tech­nique for Learn­ing a New Skilll

mr-hire
9 Jan 2019 12:45 UTC
141 points
46 comments2 min readLW link

Notic­ing Frame Differences

Raemon
30 Sep 2019 1:24 UTC
140 points
29 comments8 min readLW link

The Re­la­tion­ship Between the Village and the Mission

Raemon
12 May 2019 21:09 UTC
139 points
67 comments18 min readLW link

Honor­ing Petrov Day on LessWrong, in 2019

Ben Pace
26 Sep 2019 9:10 UTC
138 points
168 comments4 min readLW link

Chris Olah’s views on AGI safety

evhub
1 Nov 2019 20:13 UTC
137 points
33 comments12 min readLW link

Book Re­view: The Se­cret Of Our Success

Scott Alexander
5 Jun 2019 6:50 UTC
136 points
8 comments25 min readLW link
(slatestarcodex.com)

In­tegrity and ac­countabil­ity are core parts of rationality

habryka
15 Jul 2019 20:22 UTC
135 points
62 comments6 min readLW link

The Zet­telkas­ten Method

abramdemski
20 Sep 2019 13:15 UTC
133 points
60 comments40 min readLW link

The un­ex­pected difficulty of com­par­ing AlphaS­tar to humans

Richard Korzekwa
18 Sep 2019 2:20 UTC
129 points
32 comments26 min readLW link
(aiimpacts.org)

2019 AI Align­ment Liter­a­ture Re­view and Char­ity Comparison

Larks
19 Dec 2019 3:00 UTC
129 points
17 comments62 min readLW link

Yes Re­quires the Pos­si­bil­ity of No

Scott Garrabrant
17 May 2019 22:39 UTC
127 points
39 comments2 min readLW link

Risks from Learned Op­ti­miza­tion: Introduction

31 May 2019 23:44 UTC
126 points
32 comments12 min readLW link

Book Re­view: De­sign Prin­ci­ples of Biolog­i­cal Circuits

johnswentworth
5 Nov 2019 6:49 UTC
126 points
19 comments12 min readLW link

Thoughts on Hu­man Models

21 Feb 2019 9:10 UTC
124 points
22 comments10 min readLW link

Disen­tan­gling ar­gu­ments for the im­por­tance of AI safety

ricraz
21 Jan 2019 12:41 UTC
122 points
23 comments8 min readLW link

The Amish, and Strate­gic Norms around Technology

Raemon
24 Mar 2019 22:16 UTC
122 points
11 comments3 min readLW link

Co­her­ent de­ci­sions im­ply con­sis­tent utilities

Eliezer Yudkowsky
12 May 2019 21:33 UTC
122 points
54 comments26 min readLW link

CO2 Strip­per Post­mortem Thoughts

Diffractor
30 Nov 2019 21:20 UTC
121 points
33 comments8 min readLW link

Hu­mans Who Are Not Con­cen­trat­ing Are Not Gen­eral Intelligences

sarahconstantin
25 Feb 2019 20:40 UTC
120 points
29 comments6 min readLW link
(srconstantin.wordpress.com)

We run the Cen­ter for Ap­plied Ra­tion­al­ity, AMA

AnnaSalamon
19 Dec 2019 16:34 UTC
118 points
329 comments1 min readLW link

A Key Power of the Pres­i­dent is to Co­or­di­nate the Ex­e­cu­tion of Ex­ist­ing Con­crete Plans

Ben Pace
16 Jul 2019 5:06 UTC
117 points
13 comments10 min readLW link

In­tro­duc­tion to In­tro­duc­tion to Cat­e­gory Theory

countedblessings
6 Oct 2019 14:43 UTC
117 points
12 comments2 min readLW link

What I’ll be do­ing at MIRI

evhub
12 Nov 2019 23:19 UTC
117 points
6 comments1 min readLW link

Moloch Hasn’t Won

Zvi
28 Dec 2019 16:30 UTC
117 points
28 comments7 min readLW link
(thezvi.wordpress.com)

Rest Days vs Re­cov­ery Days

Unreal
19 Mar 2019 22:37 UTC
116 points
20 comments6 min readLW link

Rea­son isn’t magic

Benquo
18 Jun 2019 4:04 UTC
116 points
11 comments2 min readLW link
(benjaminrosshoffman.com)

What Comes After Epistemic Spot Checks?

Elizabeth
22 Oct 2019 17:00 UTC
116 points
8 comments3 min readLW link
(acesounderglass.com)

Paper-Read­ing for Gears

johnswentworth
4 Dec 2019 21:02 UTC
116 points
3 comments4 min readLW link

Power Buys You Dis­tance From The Crime

Elizabeth
2 Aug 2019 20:50 UTC
115 points
64 comments7 min readLW link
(acesounderglass.com)

Some Thoughts on My Psy­chi­a­try Practice

Laura B
16 Jan 2019 23:16 UTC
113 points
31 comments4 min readLW link

The Costs of Reliability

sarahconstantin
20 Jul 2019 1:20 UTC
113 points
4 comments3 min readLW link
(srconstantin.wordpress.com)

Soft take­off can still lead to de­ci­sive strate­gic advantage

Daniel Kokotajlo
23 Aug 2019 16:39 UTC
113 points
32 comments8 min readLW link

De­grees of Freedom

sarahconstantin
2 Apr 2019 21:10 UTC
112 points
31 comments11 min readLW link
(srconstantin.wordpress.com)