RSS

Anti-so­cial Punishment

Martin Sustrik27 Sep 2018 7:08 UTC
296 points
66 comments6 min readLW link3 reviews

The Pavlov Strategy

sarahconstantin20 Dec 2018 16:20 UTC
247 points
13 comments4 min readLW link
(srconstantin.wordpress.com)

Embed­ded Agents

29 Oct 2018 19:53 UTC
222 points
41 comments1 min readLW link2 reviews

The Rocket Align­ment Problem

Eliezer Yudkowsky4 Oct 2018 0:38 UTC
216 points
41 comments15 min readLW link2 reviews

A vot­ing the­ory primer for rationalists

Jameson Quinn12 Apr 2018 15:15 UTC
229 points
98 comments17 min readLW link2 reviews

Norms of Mem­ber­ship for Vol­un­tary Groups

sarahconstantin11 Dec 2018 22:10 UTC
192 points
10 comments7 min readLW link
(srconstantin.wordpress.com)

2018 AI Align­ment Liter­a­ture Re­view and Char­ity Comparison

Larks18 Dec 2018 4:46 UTC
190 points
26 comments62 min readLW link1 review

Ar­bital postmortem

alexei30 Jan 2018 13:48 UTC
227 points
110 comments19 min readLW link

Spaghetti Towers

eukaryote22 Dec 2018 5:29 UTC
187 points
28 comments3 min readLW link1 review
(eukaryotewritesblog.com)

My at­tempt to ex­plain Look­ing, in­sight med­i­ta­tion, and en­light­en­ment in non-mys­te­ri­ous terms

Kaj_Sotala8 Mar 2018 7:37 UTC
222 points
135 comments17 min readLW link2 reviews

Act of Charity

jessicata17 Nov 2018 5:19 UTC
186 points
49 comments8 min readLW link1 review

The In­tel­li­gent So­cial Web

Valentine22 Feb 2018 18:55 UTC
224 points
112 comments12 min readLW link2 reviews

Embed­ded Agency (full-text ver­sion)

15 Nov 2018 19:49 UTC
180 points
17 comments54 min readLW link

Notic­ing the Taste of Lotus

Valentine27 Apr 2018 20:05 UTC
203 points
81 comments3 min readLW link3 reviews

Is Click­bait De­stroy­ing Our Gen­eral In­tel­li­gence?

Eliezer Yudkowsky16 Nov 2018 23:06 UTC
189 points
61 comments5 min readLW link2 reviews

Un­rol­ling so­cial metacog­ni­tion: Three lev­els of meta are not enough.

Academian25 Aug 2018 12:00 UTC
187 points
44 comments7 min readLW link1 review

Real­ism about rationality

Richard_Ngo16 Sep 2018 10:46 UTC
184 points
146 comments4 min readLW link3 reviews
(thinkingcomplete.blogspot.com)

A LessWrong Crypto Autopsy

Scott Alexander28 Jan 2018 9:01 UTC
216 points
129 comments4 min readLW link4 reviews

A Sketch of Good Communication

Ben Pace31 Mar 2018 22:48 UTC
198 points
35 comments3 min readLW link1 review

Lo­cal Val­idity as a Key to San­ity and Civilization

Eliezer Yudkowsky7 Apr 2018 4:25 UTC
194 points
67 comments13 min readLW link5 reviews

The Costly Co­or­di­na­tion Mechanism of Com­mon Knowledge

Ben Pace15 Mar 2018 20:20 UTC
194 points
31 comments19 min readLW link2 reviews

Babble

alkjash10 Jan 2018 21:56 UTC
200 points
32 comments5 min readLW link2 reviews
(radimentary.wordpress.com)

Inad­e­quate Equil­ibria vs. Gover­nance of the Commons

Martin Sustrik25 May 2018 13:17 UTC
182 points
17 comments14 min readLW link2 reviews

Tran­shu­man­ism as Sim­plified Humanism

Eliezer Yudkowsky5 Dec 2018 20:12 UTC
170 points
34 comments5 min readLW link

Some cruxes on im­pact­ful al­ter­na­tives to AI policy work

Richard_Ngo10 Oct 2018 13:35 UTC
165 points
13 comments12 min readLW link

In­cor­rect hy­pothe­ses point to cor­rect observations

Kaj_Sotala20 Nov 2018 21:10 UTC
160 points
37 comments4 min readLW link
(kajsotala.fi)

“Cheat to Win”: Eng­ineer­ing Pos­i­tive So­cial Feedback

sarahconstantin5 Feb 2018 23:16 UTC
184 points
36 comments2 min readLW link

Pre­dic­tion Mar­kets: When Do They Work?

Zvi26 Jul 2018 12:30 UTC
162 points
17 comments10 min readLW link
(thezvi.wordpress.com)

Toolbox-think­ing and Law-thinking

Eliezer Yudkowsky31 May 2018 21:28 UTC
161 points
49 comments12 min readLW link

The Loud­est Alarm Is Prob­a­bly False

orthonormal2 Jan 2018 16:38 UTC
171 points
28 comments2 min readLW link1 review

The Tails Com­ing Apart As Me­taphor For Life

Scott Alexander25 Sep 2018 19:10 UTC
155 points
38 comments7 min readLW link4 reviews
(slatestarcodex.com)

De­cou­pling vs Con­tex­tu­al­is­ing Norms

Chris_Leong14 May 2018 22:44 UTC
155 points
51 comments2 min readLW link3 reviews

Prob­lem Solv­ing with Mazes and Crayon

johnswentworth19 Jun 2018 6:15 UTC
149 points
28 comments7 min readLW link

An Un­trol­lable Math­e­mat­i­cian Illustrated

abramdemski20 Mar 2018 0:00 UTC
157 points
38 comments1 min readLW link1 review

Oops on Com­mod­ity Prices

sarahconstantin10 Jun 2018 15:40 UTC
148 points
8 comments2 min readLW link
(srconstantin.wordpress.com)

His­tor­i­cal math­e­mat­i­ci­ans ex­hibit a birth or­der effect too

Eli Tyre21 Aug 2018 1:52 UTC
141 points
19 comments6 min readLW link2 reviews

Be­ing a Ro­bust Agent

Raemon18 Oct 2018 7:00 UTC
145 points
32 comments7 min readLW link2 reviews

Strate­gies for Per­sonal Growth

Raemon28 Jul 2018 18:27 UTC
142 points
27 comments4 min readLW link

Ex­pres­sive Vocabulary

Alicorn24 May 2018 6:59 UTC
143 points
71 comments5 min readLW link1 review

Is Science Slow­ing Down?

Scott Alexander27 Nov 2018 3:30 UTC
125 points
77 comments9 min readLW link1 review
(slatestarcodex.com)

Good Sa­mar­i­tans in experiments

Bucky30 Oct 2018 23:34 UTC
125 points
14 comments9 min readLW link

Meta-Hon­esty: Firm­ing Up Hon­esty Around Its Edge-Cases

Eliezer Yudkowsky29 May 2018 0:59 UTC
134 points
152 comments27 min readLW link4 reviews

Ter­ror­ism, Tylenol, and dan­ger­ous information

Davis_Kingsley12 May 2018 10:20 UTC
145 points
46 comments3 min readLW link

[Question] What makes peo­ple in­tel­lec­tu­ally ac­tive?

abramdemski29 Dec 2018 22:29 UTC
116 points
71 comments1 min readLW link

Con­trite Strate­gies and The Need For Standards

sarahconstantin24 Dec 2018 18:30 UTC
125 points
5 comments4 min readLW link
(srconstantin.wordpress.com)

On Do­ing the Improbable

Eliezer Yudkowsky28 Oct 2018 20:09 UTC
128 points
36 comments1 min readLW link1 review

Paul’s re­search agenda FAQ

zhukeepa1 Jul 2018 6:25 UTC
126 points
74 comments19 min readLW link1 review

Co­her­ence ar­gu­ments do not en­tail goal-di­rected behavior

Rohin Shah3 Dec 2018 3:26 UTC
123 points
69 comments7 min readLW link3 reviews

Un­known Knowns

Zvi28 Aug 2018 13:20 UTC
120 points
17 comments2 min readLW link1 review
(thezvi.wordpress.com)

Beyond Astro­nom­i­cal Waste

Wei Dai7 Jun 2018 21:04 UTC
125 points
41 comments3 min readLW link

Pri­son­ers’ Dilemma with Costs to Modeling

Scott Garrabrant5 Jun 2018 4:51 UTC
123 points
20 comments7 min readLW link

Challenges to Chris­ti­ano’s ca­pa­bil­ity am­plifi­ca­tion proposal

Eliezer Yudkowsky19 May 2018 18:18 UTC
124 points
54 comments23 min readLW link1 review

Me­la­tonin: Much More Than You Wanted To Know

Scott Alexander11 Jul 2018 17:40 UTC
119 points
16 comments15 min readLW link
(slatestarcodex.com)

Ro­bust­ness to Scale

Scott Garrabrant21 Feb 2018 22:55 UTC
128 points
23 comments2 min readLW link1 review

Mak­ing your­self small

Helen8 Mar 2018 14:26 UTC
127 points
53 comments11 min readLW link

De­ci­sion Theory

31 Oct 2018 18:41 UTC
117 points
45 comments1 min readLW link

Ro­bust Delegation

4 Nov 2018 16:38 UTC
116 points
10 comments1 min readLW link

Med­i­ta­tions on Momentum

Richard Meadows14 Dec 2018 10:53 UTC
103 points
32 comments10 min readLW link

Critch on ca­reer ad­vice for ju­nior AI-x-risk-con­cerned researchers

Rob Bensinger12 May 2018 2:13 UTC
118 points
25 comments4 min readLW link

Co­or­di­na­tion Prob­lems in Evolu­tion: Ei­gen’s Paradox

Martin Sustrik12 Oct 2018 12:40 UTC
102 points
6 comments8 min readLW link
(250bpm.com)

Are eth­i­cal asym­me­tries from prop­erty rights?

KatjaGrace2 Jul 2018 3:00 UTC
108 points
37 comments3 min readLW link
(meteuphoric.com)

Y Couchinator

Alicorn18 Aug 2018 3:41 UTC
111 points
33 comments4 min readLW link

Op­ti­miza­tion Amplifies

Scott Garrabrant27 Jun 2018 1:51 UTC
114 points
12 comments4 min readLW link

Nam­ing the Nameless

sarahconstantin22 Mar 2018 0:35 UTC
120 points
43 comments13 min readLW link3 reviews

[Question] How did academia en­sure pa­pers were cor­rect in the early 20th Cen­tury?

Ben Pace29 Dec 2018 23:37 UTC
99 points
17 comments2 min readLW link1 review

Why ev­ery­thing might have taken so long

KatjaGrace1 Jan 2018 1:00 UTC
112 points
16 comments3 min readLW link1 review
(meteuphoric.wordpress.com)

Coun­ter­fac­tual Mug­ging Poker Game

Scott Garrabrant13 Jun 2018 23:34 UTC
111 points
3 comments1 min readLW link

Sub­si­diz­ing Pre­dic­tion Markets

Zvi17 Aug 2018 15:40 UTC
96 points
8 comments11 min readLW link
(thezvi.wordpress.com)

Two Ne­glected Prob­lems in Hu­man-AI Safety

Wei Dai16 Dec 2018 22:13 UTC
98 points
24 comments2 min readLW link

Sub­sys­tem Alignment

6 Nov 2018 16:16 UTC
99 points
12 comments1 min readLW link

Sam Har­ris and the Is–Ought Gap

Tyrrell_McAllister16 Nov 2018 1:04 UTC
89 points
46 comments6 min readLW link

The Kelly Criterion

Zvi15 Oct 2018 21:20 UTC
101 points
24 comments3 min readLW link
(thezvi.wordpress.com)

Ma­chine Learn­ing Anal­ogy for Med­i­ta­tion (illus­trated)

abramdemski28 Jun 2018 22:51 UTC
97 points
48 comments1 min readLW link

Play­ing Politics

sarahconstantin5 Dec 2018 0:30 UTC
97 points
45 comments12 min readLW link
(srconstantin.wordpress.com)

Pre­limi­nary thoughts on moral weight

lukeprog13 Aug 2018 23:45 UTC
93 points
49 comments8 min readLW link2 reviews

Write a Thou­sand Roads to Rome

Screwtape8 Feb 2018 18:09 UTC
105 points
17 comments4 min readLW link

Tran­shu­man­ists Don’t Need Spe­cial Dispositions

Eliezer Yudkowsky7 Dec 2018 22:24 UTC
96 points
18 comments5 min readLW link

Towards a New Im­pact Measure

TurnTrout18 Sep 2018 17:21 UTC
100 points
159 comments33 min readLW link2 reviews

His­tory of the Devel­op­ment of Log­i­cal Induction

Scott Garrabrant29 Aug 2018 3:15 UTC
100 points
4 comments5 min readLW link

Zetetic explanation

Benquo27 Aug 2018 0:12 UTC
90 points
138 comments6 min readLW link
(benjaminrosshoffman.com)

Trust Me I’m Ly­ing: A Sum­mary and Review

quanticle13 Aug 2018 2:55 UTC
100 points
11 comments7 min readLW link
(quanticle.net)

Public Po­si­tions and Pri­vate Guts

Vaniver11 Oct 2018 19:38 UTC
85 points
13 comments8 min readLW link

Should ethi­cists be in­side or out­side a pro­fes­sion?

Eliezer Yudkowsky12 Dec 2018 1:40 UTC
91 points
7 comments9 min readLW link

Bot­tle Caps Aren’t Optimisers

DanielFilan31 Aug 2018 18:30 UTC
97 points
22 comments3 min readLW link1 review
(danielfilan.com)

Up­date the best text­books on ev­ery sub­ject list

ryan_b8 Nov 2018 20:54 UTC
92 points
14 comments1 min readLW link

Of Two Minds

Valentine17 May 2018 4:34 UTC
93 points
12 comments2 min readLW link

Va­ri­eties Of Ar­gu­men­ta­tive Experience

Scott Alexander8 May 2018 8:20 UTC
93 points
11 comments18 min readLW link2 reviews
(slatestarcodex.com)

Un­der­stand­ing is translation

cousin_it28 May 2018 13:56 UTC
92 points
23 comments1 min readLW link

Embed­ded World-Models

2 Nov 2018 16:07 UTC
92 points
16 comments1 min readLW link

The fun­nel of hu­man experience

eukaryote10 Oct 2018 2:46 UTC
83 points
31 comments3 min readLW link1 review
(eukaryotewritesblog.com)

Embed­ded Curiosities

8 Nov 2018 14:19 UTC
91 points
1 comment2 min readLW link

In Log­i­cal Time, All Games are Iter­ated Games

abramdemski20 Sep 2018 2:01 UTC
93 points
10 comments5 min readLW link

Player vs. Char­ac­ter: A Two-Level Model of Ethics

sarahconstantin14 Dec 2018 19:40 UTC
88 points
27 comments7 min readLW link3 reviews
(srconstantin.wordpress.com)

Dou­ble-Dip­ping in Dun­ning—Kruger

isovector28 Nov 2018 3:40 UTC
88 points
31 comments3 min readLW link

Ham­mer­time Day 1: Bug Hunt

alkjash30 Jan 2018 6:40 UTC
105 points
25 comments5 min readLW link
(radimentary.wordpress.com)

An­nounce­ment: AI al­ign­ment prize round 3 win­ners and next round

cousin_it15 Jul 2018 7:40 UTC
93 points
7 comments1 min readLW link

New edi­tion of “Ra­tion­al­ity: From AI to Zom­bies”

Rob Bensinger15 Dec 2018 21:33 UTC
84 points
27 comments2 min readLW link

In­tro­duc­ing the AI Align­ment Fo­rum (FAQ)

29 Oct 2018 21:07 UTC
86 points
8 comments6 min readLW link

Coun­ter­in­tu­itive Com­par­a­tive Advantage

Wei Dai28 Nov 2018 20:33 UTC
84 points
8 comments2 min readLW link

Ar­gu­ments about fast takeoff

paulfchristiano25 Feb 2018 4:53 UTC
89 points
65 comments2 min readLW link1 review
(sideways-view.com)