RSS

What failure looks like

paulfchristiano17 Mar 2019 20:18 UTC
401 points
54 comments8 min readLW link2 reviews

Be­ing the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC
398 points
57 comments3 min readLW link3 reviews

Wel­come to LessWrong!

14 Jun 2019 19:42 UTC
397 points
48 comments2 min readLW link

The Parable of Pre­dict-O-Matic

abramdemski15 Oct 2019 0:49 UTC
342 points
41 comments14 min readLW link2 reviews

How to Ig­nore Your Emo­tions (while also think­ing you’re awe­some at emo­tions)

Hazard31 Jul 2019 13:34 UTC
335 points
72 comments4 min readLW link4 reviews

Book sum­mary: Un­lock­ing the Emo­tional Brain

Kaj_Sotala8 Oct 2019 19:11 UTC
313 points
48 comments21 min readLW link3 reviews

Heads I Win, Tails?—Never Heard of Her; Or, Selec­tive Re­port­ing and the Tragedy of the Green Rationalists

Zack_M_Davis24 Sep 2019 4:12 UTC
296 points
40 comments8 min readLW link2 reviews

Align­ment Re­search Field Guide

abramdemski8 Mar 2019 19:57 UTC
263 points
9 comments17 min readLW link2 reviews

Build­ing up to an In­ter­nal Fam­ily Sys­tems model

Kaj_Sotala26 Jan 2019 12:25 UTC
263 points
86 comments28 min readLW link2 reviews

Is Ra­tion­al­ist Self-Im­prove­ment Real?

Jacob Falkovich9 Dec 2019 17:11 UTC
256 points
78 comments11 min readLW link3 reviews

“Other peo­ple are wrong” vs “I am right”

Buck22 Feb 2019 20:01 UTC
246 points
20 comments9 min readLW link2 reviews

Yes Re­quires the Pos­si­bil­ity of No

Scott Garrabrant17 May 2019 22:39 UTC
241 points
55 comments2 min readLW link2 reviews

Asym­met­ric Justice

Zvi25 Apr 2019 16:00 UTC
223 points
101 comments5 min readLW link2 reviews
(thezvi.wordpress.com)

De­bate on In­stru­men­tal Con­ver­gence be­tween LeCun, Rus­sell, Ben­gio, Zador, and More

Ben Pace4 Oct 2019 4:08 UTC
221 points
61 comments15 min readLW link2 reviews

Rule Thinkers In, Not Out

Scott Alexander27 Feb 2019 2:40 UTC
219 points
67 comments4 min readLW link4 reviews
(slatestarcodex.com)

The Zet­telkas­ten Method

abramdemski20 Sep 2019 13:15 UTC
216 points
90 comments42 min readLW link3 reviews

Per­son­al­ized Medicine For Real

sarahconstantin4 Mar 2019 22:40 UTC
213 points
16 comments5 min readLW link
(srconstantin.wordpress.com)

Mis­takes with Con­ser­va­tion of Ex­pected Evidence

abramdemski8 Jun 2019 23:07 UTC
212 points
25 comments12 min readLW link1 review

Book Re­view: De­sign Prin­ci­ples of Biolog­i­cal Circuits

johnswentworth5 Nov 2019 6:49 UTC
209 points
24 comments12 min readLW link1 review

Notic­ing Frame Differences

Raemon30 Sep 2019 1:24 UTC
207 points
39 comments9 min readLW link2 reviews

Chris Olah’s views on AGI safety

evhub1 Nov 2019 20:13 UTC
206 points
38 comments12 min readLW link2 reviews

You Get About Five Words

Raemon12 Mar 2019 20:30 UTC
197 points
76 comments1 min readLW link6 reviews

The 3 Books Tech­nique for Learn­ing a New Skilll

Matt Goldenberg9 Jan 2019 12:45 UTC
196 points
48 comments2 min readLW link

Steel­man­ning Divination

Vaniver5 Jun 2019 22:53 UTC
191 points
48 comments6 min readLW link2 reviews

Rest Days vs Re­cov­ery Days

Unreal19 Mar 2019 22:37 UTC
191 points
35 comments6 min readLW link1 review

Power Buys You Dis­tance From The Crime

Elizabeth2 Aug 2019 20:50 UTC
189 points
75 comments7 min readLW link1 review
(acesounderglass.com)

Hu­mans Who Are Not Con­cen­trat­ing Are Not Gen­eral Intelligences

sarahconstantin25 Feb 2019 20:40 UTC
186 points
35 comments6 min readLW link1 review
(srconstantin.wordpress.com)

Risks from Learned Op­ti­miza­tion: Introduction

31 May 2019 23:44 UTC
183 points
42 comments12 min readLW link3 reviews

hu­man psy­chol­in­guists: a crit­i­cal appraisal

nostalgebraist31 Dec 2019 0:20 UTC
179 points
59 comments16 min readLW link2 reviews
(nostalgebraist.tumblr.com)

Moloch Hasn’t Won

Zvi28 Dec 2019 16:30 UTC
178 points
40 comments7 min readLW link1 review
(thezvi.wordpress.com)

Why Subagents?

johnswentworth1 Aug 2019 22:17 UTC
174 points
48 comments7 min readLW link1 review

Evolu­tion of Modularity

johnswentworth14 Nov 2019 6:49 UTC
173 points
12 comments2 min readLW link1 review

Jeff Hawk­ins on neu­ro­mor­phic AGI within 20 years

Steven Byrnes15 Jul 2019 19:16 UTC
170 points
24 comments12 min readLW link

Gears-Level Models are Cap­i­tal Investments

johnswentworth22 Nov 2019 22:41 UTC
170 points
28 comments7 min readLW link1 review

Selec­tion vs Control

abramdemski2 Jun 2019 7:01 UTC
168 points
25 comments11 min readLW link2 reviews

Book Sum­mary: Con­scious­ness and the Brain

Kaj_Sotala16 Jan 2019 14:43 UTC
164 points
20 comments26 min readLW link1 review

Seek­ing Power is Often Con­ver­gently In­stru­men­tal in MDPs

5 Dec 2019 2:33 UTC
161 points
39 comments17 min readLW link2 reviews
(arxiv.org)

Paper-Read­ing for Gears

johnswentworth4 Dec 2019 21:02 UTC
159 points
6 comments4 min readLW link1 review

Book Re­view: The Se­cret Of Our Success

Scott Alexander5 Jun 2019 6:50 UTC
158 points
19 comments25 min readLW link2 reviews
(slatestarcodex.com)

The Schel­ling Choice is “Rab­bit”, not “Stag”

Raemon8 Jun 2019 0:24 UTC
157 points
52 comments12 min readLW link3 reviews

Some Thoughts on My Psy­chi­a­try Practice

Laura B16 Jan 2019 23:16 UTC
154 points
43 comments4 min readLW link2 reviews

In­tegrity and ac­countabil­ity are core parts of rationality

habryka15 Jul 2019 20:22 UTC
152 points
66 comments6 min readLW link1 review

A Key Power of the Pres­i­dent is to Co­or­di­nate the Ex­e­cu­tion of Ex­ist­ing Con­crete Plans

Ben Pace16 Jul 2019 5:06 UTC
152 points
13 comments10 min readLW link

Rea­son isn’t magic

Benquo18 Jun 2019 4:04 UTC
151 points
19 comments2 min readLW link3 reviews
(benjaminrosshoffman.com)

The Com­mit­ment Races problem

Daniel Kokotajlo23 Aug 2019 1:58 UTC
150 points
56 comments5 min readLW link

Co­her­ent de­ci­sions im­ply con­sis­tent utilities

Eliezer Yudkowsky12 May 2019 21:33 UTC
147 points
81 comments26 min readLW link3 reviews

The Costs of Reliability

sarahconstantin20 Jul 2019 1:20 UTC
147 points
13 comments3 min readLW link2 reviews
(srconstantin.wordpress.com)

Fo­rum par­ti­ci­pa­tion as a re­search strategy

Wei Dai30 Jul 2019 18:09 UTC
147 points
44 comments3 min readLW link1 review

Un­der­stand­ing “Deep Dou­ble Des­cent”

evhub6 Dec 2019 0:00 UTC
146 points
51 comments5 min readLW link4 reviews

From Per­sonal to Pri­son Gangs: En­forc­ing Proso­cial Behavior

johnswentworth24 Jan 2019 18:07 UTC
146 points
26 comments5 min readLW link2 reviews

The un­ex­pected difficulty of com­par­ing AlphaS­tar to humans

Richard Korzekwa 18 Sep 2019 2:20 UTC
145 points
36 comments26 min readLW link
(aiimpacts.org)

Men­tal Mountains

Scott Alexander27 Nov 2019 5:30 UTC
143 points
14 comments15 min readLW link1 review
(slatestarcodex.com)

Con­ver­sa­tional Cul­tures: Com­bat vs Nur­ture (V2)

Ruby31 Dec 2019 20:23 UTC
142 points
92 comments9 min readLW link5 reviews

In­te­grat­ing dis­agree­ing subagents

Kaj_Sotala14 May 2019 14:06 UTC
141 points
15 comments21 min readLW link

Honor­ing Petrov Day on LessWrong, in 2019

Ben Pace26 Sep 2019 9:10 UTC
137 points
168 comments4 min readLW link

The Amish, and Strate­gic Norms around Technology

Raemon24 Mar 2019 22:16 UTC
136 points
18 comments3 min readLW link2 reviews

Un­con­scious Economics

jacobjacob27 Feb 2019 12:58 UTC
135 points
30 comments4 min readLW link3 reviews

Subagents, akra­sia, and co­her­ence in humans

Kaj_Sotala25 Mar 2019 14:24 UTC
134 points
31 comments16 min readLW link

Disen­tan­gling ar­gu­ments for the im­por­tance of AI safety

Richard_Ngo21 Jan 2019 12:41 UTC
133 points
23 comments8 min readLW link

The Forces of Bland­ness and the Disagree­able Majority

sarahconstantin28 Apr 2019 19:44 UTC
132 points
27 comments3 min readLW link2 reviews
(srconstantin.wordpress.com)

A mechanis­tic model of meditation

Kaj_Sotala6 Nov 2019 21:37 UTC
130 points
11 comments21 min readLW link

2019 AI Align­ment Liter­a­ture Re­view and Char­ity Comparison

Larks19 Dec 2019 3:00 UTC
130 points
18 comments62 min readLW link

The Real Rules Have No Exceptions

Said Achmiz23 Jul 2019 3:38 UTC
129 points
57 comments1 min readLW link2 reviews

Thoughts on Hu­man Models

21 Feb 2019 9:10 UTC
126 points
32 comments10 min readLW link1 review

Writ­ing chil­dren’s pic­ture books

jessicata25 Jun 2019 21:43 UTC
125 points
22 comments5 min readLW link
(unstableontology.com)

Every­body Knows

Zvi2 Jul 2019 12:20 UTC
125 points
21 comments4 min readLW link1 review
(thezvi.wordpress.com)

The Curse Of The Counterfactual

pjeby1 Nov 2019 18:34 UTC
124 points
34 comments19 min readLW link1 review

Soft take­off can still lead to de­ci­sive strate­gic advantage

Daniel Kokotajlo23 Aug 2019 16:39 UTC
122 points
47 comments8 min readLW link4 reviews

Firm­ing Up Not-Ly­ing Around Its Edge-Cases Is Less Broadly Use­ful Than One Might Ini­tially Think

Zack_M_Davis27 Dec 2019 5:09 UTC
122 points
43 comments8 min readLW link2 reviews

Refram­ing Su­per­in­tel­li­gence: Com­pre­hen­sive AI Ser­vices as Gen­eral Intelligence

Rohin Shah8 Jan 2019 7:12 UTC
121 points
77 comments5 min readLW link2 reviews
(www.fhi.ox.ac.uk)

Se­quence in­tro­duc­tion: non-agent and mul­ti­a­gent mod­els of mind

Kaj_Sotala7 Jan 2019 14:12 UTC
121 points
15 comments7 min readLW link1 review

What Comes After Epistemic Spot Checks?

Elizabeth22 Oct 2019 17:00 UTC
121 points
9 comments3 min readLW link
(acesounderglass.com)

Utility ≠ Reward

Vlad Mikulik5 Sep 2019 17:28 UTC
121 points
24 comments1 min readLW link2 reviews

Where to Draw the Boundaries?

Zack_M_Davis13 Apr 2019 21:34 UTC
120 points
108 comments13 min readLW link3 reviews

The Main Sources of AI Risk?

21 Mar 2019 18:28 UTC
119 points
26 comments2 min readLW link

De­cep­tive Alignment

5 Jun 2019 20:16 UTC
117 points
20 comments17 min readLW link

Quotes from Mo­ral Mazes

Zvi30 May 2019 11:50 UTC
116 points
27 comments53 min readLW link
(thezvi.wordpress.com)

AI Safety “Suc­cess Sto­ries”

Wei Dai7 Sep 2019 2:54 UTC
116 points
27 comments4 min readLW link1 review

No non­sense ver­sion of the “racial al­gorithm bias”

Yuxi_Liu13 Jul 2019 15:39 UTC
115 points
20 comments2 min readLW link

Say Wrong Things

Gordon Seidoh Worley24 May 2019 22:11 UTC
114 points
13 comments4 min readLW link

In­tro­duc­tion to In­tro­duc­tion to Cat­e­gory Theory

countedblessings6 Oct 2019 14:43 UTC
113 points
19 comments2 min readLW link

CO2 Strip­per Post­mortem Thoughts

Diffractor30 Nov 2019 21:20 UTC
113 points
37 comments8 min readLW link

AlphaS­tar: Im­pres­sive for RL progress, not for AGI progress

orthonormal2 Nov 2019 1:50 UTC
113 points
58 comments2 min readLW link1 review

S-Curves for Trend Forecasting

Matt Goldenberg23 Jan 2019 18:17 UTC
112 points
23 comments7 min readLW link4 reviews

[Question] Where are peo­ple think­ing and talk­ing about global co­or­di­na­tion for AI safety?

Wei Dai22 May 2019 6:24 UTC
112 points
22 comments1 min readLW link

What is op­er­a­tions?

Swimmer963 (Miranda Dixon-Luinenburg) 26 Sep 2019 14:16 UTC
110 points
9 comments7 min readLW link

Com­plex Be­hav­ior from Sim­ple (Sub)Agents

moridinamael10 May 2019 21:44 UTC
110 points
13 comments9 min readLW link1 review

Prop­a­gat­ing Facts into Aesthetics

Raemon19 Dec 2019 4:09 UTC
109 points
35 comments11 min readLW link1 review

The AI Timelines Scam

jessicata11 Jul 2019 2:52 UTC
108 points
105 comments7 min readLW link3 reviews
(unstableontology.com)

We run the Cen­ter for Ap­plied Ra­tion­al­ity, AMA

AnnaSalamon19 Dec 2019 16:34 UTC
108 points
324 comments1 min readLW link

The Hard Work of Trans­la­tion (Bud­dhism)

romeostevensit7 Apr 2019 21:04 UTC
107 points
139 comments5 min readLW link3 reviews

Sys­tem 2 as work­ing-mem­ory aug­mented Sys­tem 1 reasoning

Kaj_Sotala25 Sep 2019 8:39 UTC
107 points
23 comments16 min readLW link

Liter­a­ture Re­view: Distributed Teams

Elizabeth16 Apr 2019 1:19 UTC
106 points
37 comments6 min readLW link1 review

Book Re­view: The Struc­ture Of Scien­tific Revolutions

Scott Alexander9 Jan 2019 7:10 UTC
104 points
30 comments19 min readLW link1 review
(slatestarcodex.com)

Gra­di­ent hacking

evhub16 Oct 2019 0:53 UTC
104 points
39 comments3 min readLW link2 reviews

The In­ner Align­ment Problem

4 Jun 2019 1:20 UTC
103 points
17 comments13 min readLW link

Turn­ing air into bread

jasoncrawford21 Oct 2019 17:50 UTC
103 points
12 comments6 min readLW link1 review
(rootsofprogress.org)

1960: The Year The Sin­gu­lar­ity Was Cancelled

Scott Alexander23 Apr 2019 1:30 UTC
103 points
15 comments11 min readLW link1 review
(slatestarcodex.com)

De­grees of Freedom

sarahconstantin2 Apr 2019 21:10 UTC
103 points
31 comments11 min readLW link
(srconstantin.wordpress.com)

The LessWrong 2018 Review

Raemon21 Nov 2019 2:50 UTC
101 points
91 comments7 min readLW link