Heads I Win, Tails?—Never Heard of Her; Or, Selec­tive Re­port­ing and the Tragedy of the Green Rationalists

Zack_M_Davis24 Sep 2019 4:12 UTC
299 points
40 comments8 min readLW link2 reviews

The Zet­telkas­ten Method

abramdemski20 Sep 2019 13:15 UTC
216 points
90 comments42 min readLW link3 reviews

The un­ex­pected difficulty of com­par­ing AlphaS­tar to humans

Richard Korzekwa 18 Sep 2019 2:20 UTC
145 points
36 comments26 min readLW link
(aiimpacts.org)

Honor­ing Petrov Day on LessWrong, in 2019

Ben Pace26 Sep 2019 9:10 UTC
137 points
168 comments4 min readLW link

Utility ≠ Reward

Vlad Mikulik5 Sep 2019 17:28 UTC
121 points
24 comments1 min readLW link2 reviews

AI Safety “Suc­cess Sto­ries”

Wei Dai7 Sep 2019 2:54 UTC
117 points
27 comments4 min readLW link1 review

What is op­er­a­tions?

Swimmer963 (Miranda Dixon-Luinenburg) 26 Sep 2019 14:16 UTC
110 points
9 comments7 min readLW link

Gears vs Behavior

johnswentworth19 Sep 2019 6:50 UTC
107 points
13 comments7 min readLW link1 review

Sys­tem 2 as work­ing-mem­ory aug­mented Sys­tem 1 reasoning

Kaj_Sotala25 Sep 2019 8:39 UTC
107 points
23 comments16 min readLW link

The Power to De­mol­ish Bad Arguments

Liron2 Sep 2019 12:57 UTC
97 points
83 comments11 min readLW link6 reviews

Refram­ing Impact

TurnTrout20 Sep 2019 19:03 UTC
96 points
15 comments3 min readLW link1 review

The Power to Judge Startup Ideas

Liron4 Sep 2019 15:07 UTC
93 points
28 comments8 min readLW link

Mee­tups as In­sti­tu­tions for In­tel­lec­tual Progress

mingyuan17 Sep 2019 5:23 UTC
91 points
26 comments7 min readLW link

Refram­ing the evolu­tion­ary benefit of sex

paulfchristiano14 Sep 2019 17:00 UTC
91 points
21 comments2 min readLW link
(sideways-view.com)

Ra­tion­al­ity Ex­er­cises Prize of Septem­ber 2019 ($1,000)

Ben Pace11 Sep 2019 0:19 UTC
89 points
18 comments5 min readLW link

The Power to Teach Con­cepts Better

Liron23 Sep 2019 0:21 UTC
89 points
22 comments8 min readLW link1 review

How Speci­fic­ity Works

Liron3 Sep 2019 12:11 UTC
88 points
47 comments7 min readLW link

Bioinfohazards

Spiracular17 Sep 2019 2:41 UTC
87 points
14 comments18 min readLW link2 reviews

The strat­egy-steal­ing assumption

paulfchristiano16 Sep 2019 15:23 UTC
86 points
53 comments12 min readLW link3 reviews

A Cri­tique of Func­tional De­ci­sion Theory

wdmacaskill13 Sep 2019 19:23 UTC
86 points
56 comments20 min readLW link

Novum Or­ganum: Introduction

Ruby19 Sep 2019 22:34 UTC
86 points
5 comments6 min readLW link

Speci­fic­ity: Your Brain’s Superpower

Liron2 Sep 2019 12:53 UTC
82 points
8 comments1 min readLW link

Fol­low-Up to Petrov Day, 2019

Ben Pace27 Sep 2019 23:47 UTC
78 points
20 comments3 min readLW link

How Much is Your Time Worth?

lynettebye2 Sep 2019 6:19 UTC
77 points
22 comments6 min readLW link1 review

Are min­i­mal cir­cuits de­cep­tive?

evhub7 Sep 2019 18:11 UTC
77 points
11 comments8 min readLW link

Don’t de­pend on oth­ers to ask for explanations

Wei Dai18 Sep 2019 19:12 UTC
77 points
10 comments1 min readLW link

Par­tial Agency

abramdemski27 Sep 2019 22:04 UTC
72 points
18 comments9 min readLW link

De­duc­ing Impact

TurnTrout24 Sep 2019 21:14 UTC
72 points
28 comments1 min readLW link

Con­crete ex­per­i­ments in in­ner alignment

evhub6 Sep 2019 22:16 UTC
71 points
12 comments6 min readLW link

A sim­ple en­vi­ron­ment for show­ing mesa misalignment

Matthew Barnett26 Sep 2019 4:44 UTC
71 points
9 comments2 min readLW link

Value Impact

TurnTrout23 Sep 2019 0:47 UTC
70 points
10 comments1 min readLW link

Re­laxed ad­ver­sar­ial train­ing for in­ner alignment

evhub10 Sep 2019 23:03 UTC
69 points
27 comments27 min readLW link

Modes of Petrov Day

Raemon17 Sep 2019 2:47 UTC
69 points
30 comments1 min readLW link

The YouTube Revolu­tion in Knowl­edge Transfer

Samo Burja17 Sep 2019 20:10 UTC
67 points
7 comments4 min readLW link
(medium.com)

At­tain­able Utility The­ory: Why Things Matter

TurnTrout27 Sep 2019 16:48 UTC
65 points
24 comments1 min readLW link

Chris­ti­ano de­ci­sion the­ory excerpt

Rob Bensinger29 Sep 2019 2:55 UTC
65 points
0 comments5 min readLW link

Shal­low Re­view of Con­sis­tency in State­ment Evaluation

Elizabeth9 Sep 2019 23:21 UTC
65 points
6 comments9 min readLW link

Candy for Nets

jefftk29 Sep 2019 11:10 UTC
62 points
1 comment2 min readLW link
(www.jefftk.com)

Age gaps and Birth or­der: Failed re­pro­duc­tion of results

Bucky7 Sep 2019 19:22 UTC
57 points
1 comment5 min readLW link

Novum Or­ganum: Preface

Francis Bacon19 Sep 2019 22:51 UTC
54 points
5 comments7 min readLW link

In­te­grat­ing the Lindy Effect

lsusr7 Sep 2019 17:38 UTC
53 points
11 comments2 min readLW link1 review

Free Money at Pre­dic­tIt?

Zvi26 Sep 2019 16:10 UTC
49 points
17 comments6 min readLW link
(thezvi.wordpress.com)

Prob­a­bil­ity as Min­i­mal Map

johnswentworth1 Sep 2019 19:19 UTC
48 points
10 comments5 min readLW link

Coun­ter­fac­tual Or­a­cles = on­line su­per­vised learn­ing with ran­dom se­lec­tion of train­ing episodes

Wei Dai10 Sep 2019 8:29 UTC
48 points
26 comments3 min readLW link

Age gaps and Birth or­der: Reanalysis

Bucky7 Sep 2019 19:33 UTC
48 points
4 comments8 min readLW link

Do Suffi­ciently Ad­vanced Agents Use Logic?

abramdemski13 Sep 2019 19:53 UTC
47 points
10 comments9 min readLW link

Free-to-Play Games: Three Key Trade-Offs

Zvi10 Sep 2019 12:10 UTC
47 points
4 comments5 min readLW link
(thezvi.wordpress.com)

Real­ism and Rationality

bmgarfinkel16 Sep 2019 3:09 UTC
45 points
49 comments23 min readLW link

Con­ver­sa­tion with Paul Christiano

abergal11 Sep 2019 23:20 UTC
44 points
6 comments30 min readLW link
(aiimpacts.org)

Towards an em­piri­cal in­ves­ti­ga­tion of in­ner alignment

evhub23 Sep 2019 20:43 UTC
44 points
9 comments6 min readLW link