RSS

Best of LessWrong

TagLast edit: 9 Feb 2023 2:01 UTC by Raemon

ARC’s first tech­ni­cal re­port: Elic­it­ing La­tent Knowledge

14 Dec 2021 20:09 UTC
225 points
90 comments1 min readLW link3 reviews
(docs.google.com)

Sav­ing Time

Scott Garrabrant18 May 2021 20:11 UTC
151 points
20 comments4 min readLW link1 review

Lars Doucet’s Ge­or­gism se­ries on As­tral Codex Ten

Sune4 Dec 2021 19:43 UTC
13 points
2 comments1 min readLW link1 review
(astralcodexten.substack.com)

Worst-case think­ing in AI alignment

Buck23 Dec 2021 1:29 UTC
162 points
18 comments6 min readLW link2 reviews

The Point of Trade

Eliezer Yudkowsky22 Jun 2021 17:56 UTC
171 points
76 comments4 min readLW link1 review

Spe­cial­iz­ing in Prob­lems We Don’t Understand

johnswentworth10 Apr 2021 22:40 UTC
159 points
29 comments8 min readLW link1 review

You are prob­a­bly un­der­es­ti­mat­ing how good self-love can be

Charlie Rogers-Smith14 Nov 2021 0:41 UTC
144 points
19 comments12 min readLW link1 review

Cup-Stack­ing Skills (or, Reflex­ive In­vol­un­tary Men­tal Mo­tions)

[DEACTIVATED] Duncan Sabien11 Oct 2021 7:16 UTC
117 points
36 comments7 min readLW link2 reviews

“PR” is cor­ro­sive; “rep­u­ta­tion” is not.

AnnaSalamon14 Feb 2021 3:32 UTC
307 points
93 comments2 min readLW link3 reviews

Nu­clear war is un­likely to cause hu­man extinction

Jeffrey Ladish7 Nov 2020 5:42 UTC
123 points
47 comments11 min readLW link3 reviews

Can crimes be dis­cussed liter­ally?

Benquo22 Mar 2020 20:17 UTC
102 points
38 comments2 min readLW link3 reviews
(benjaminrosshoffman.com)

The Road to Mazedom

Zvi18 Jan 2020 14:10 UTC
94 points
25 comments7 min readLW link2 reviews
(thezvi.wordpress.com)

Notes from “Don’t Shoot the Dog”

juliawise2 Apr 2021 16:34 UTC
244 points
11 comments12 min readLW link1 review

Strong Ev­i­dence is Common

Mark Xu13 Mar 2021 22:04 UTC
244 points
49 comments1 min readLW link4 reviews
(markxu.com)

The date of AI Takeover is not the day the AI takes over

Daniel Kokotajlo22 Oct 2020 10:41 UTC
144 points
32 comments2 min readLW link1 review

In­ner Align­ment: Ex­plain like I’m 12 Edition

Rafael Harth1 Aug 2020 15:24 UTC
179 points
46 comments13 min readLW link2 reviews

Frame Control

Aella27 Nov 2021 22:59 UTC
314 points
282 comments23 min readLW link2 reviews

What Do GDP Growth Curves Really Mean?

johnswentworth7 Oct 2021 21:58 UTC
219 points
64 comments8 min readLW link2 reviews

Anti-Aging: State of the Art

JackH31 Dec 2020 19:07 UTC
371 points
176 comments11 min readLW link1 review

Covid-19: My Cur­rent Model

Zvi31 May 2020 17:40 UTC
188 points
74 comments19 min readLW link1 review
(thezvi.wordpress.com)

Shoulder Ad­vi­sors 101

[DEACTIVATED] Duncan Sabien9 Oct 2021 5:30 UTC
193 points
124 comments14 min readLW link2 reviews

Utility Max­i­miza­tion = De­scrip­tion Length Minimization

johnswentworth18 Feb 2021 18:04 UTC
202 points
44 comments5 min readLW link

A non-mys­ti­cal ex­pla­na­tion of “no-self” (three char­ac­ter­is­tics se­ries)

Kaj_Sotala8 May 2020 10:37 UTC
105 points
65 comments20 min readLW link1 review

What Mul­tipo­lar Failure Looks Like, and Ro­bust Agent-Ag­nos­tic Pro­cesses (RAAPs)

Andrew_Critch31 Mar 2021 23:50 UTC
272 points
64 comments22 min readLW link1 review

Elephant seal 2

KatjaGrace2 Feb 2021 9:40 UTC
57 points
5 comments1 min readLW link2 reviews
(worldspiritsockpuppet.com)

The Poin­t­ers Prob­lem: Hu­man Values Are A Func­tion Of Hu­mans’ La­tent Variables

johnswentworth18 Nov 2020 17:47 UTC
123 points
49 comments11 min readLW link2 reviews

Stud­ies On Slack

Scott Alexander13 May 2020 5:00 UTC
151 points
34 comments24 min readLW link1 review
(slatestarcodex.com)

The Death of Be­hav­ioral Economics

habryka22 Aug 2021 22:39 UTC
152 points
24 comments1 min readLW link2 reviews
(www.thebehavioralscientist.com)

Fea­ture Selection

Zack_M_Davis1 Nov 2021 0:22 UTC
315 points
24 comments16 min readLW link1 review

Draft re­port on AI timelines

Ajeya Cotra18 Sep 2020 23:47 UTC
214 points
56 comments1 min readLW link1 review

Search ver­sus design

Alex Flint16 Aug 2020 16:53 UTC
100 points
40 comments36 min readLW link1 review

Rul­ing Out Every­thing Else

[DEACTIVATED] Duncan Sabien27 Oct 2021 21:50 UTC
190 points
51 comments21 min readLW link2 reviews

There’s no such thing as a tree (phy­lo­ge­net­i­cally)

eukaryote3 May 2021 3:47 UTC
333 points
58 comments8 min readLW link2 reviews
(eukaryotewritesblog.com)

Si­mu­lacrum 3 As Stag-Hunt Strategy

johnswentworth26 Jan 2021 19:40 UTC
178 points
37 comments4 min readLW link3 reviews

Lies, Damn Lies, and Fabri­cated Options

[DEACTIVATED] Duncan Sabien17 Oct 2021 2:47 UTC
288 points
131 comments14 min readLW link2 reviews

Catch­ing the Spark

LoganStrohl30 Jan 2021 23:23 UTC
111 points
21 comments36 min readLW link1 review

Is Suc­cess the Enemy of Free­dom? (Full)

alkjash26 Oct 2020 20:25 UTC
286 points
68 comments9 min readLW link1 review
(radimentary.wordpress.com)

What cog­ni­tive bi­ases feel like from the inside

chaosmage3 Jan 2020 14:24 UTC
249 points
32 comments4 min readLW link

Swiss Poli­ti­cal Sys­tem: More than You ever Wanted to Know (I.)

Martin Sustrik19 Jul 2020 1:11 UTC
171 points
39 comments24 min readLW link2 reviews

My re­search methodology

paulfchristiano22 Mar 2021 21:20 UTC
159 points
38 comments16 min readLW link1 review
(ai-alignment.com)

The Plan

johnswentworth10 Dec 2021 23:41 UTC
254 points
78 comments14 min readLW link1 review

Trapped Pri­ors As A Ba­sic Prob­lem Of Rationality

Scott Alexander12 Mar 2021 20:02 UTC
141 points
32 comments14 min readLW link3 reviews

The Align­ment Prob­lem: Ma­chine Learn­ing and Hu­man Values

Rohin Shah6 Oct 2020 17:41 UTC
120 points
7 comments6 min readLW link1 review
(www.amazon.com)

In­tro­duc­tion to Carte­sian Frames

Scott Garrabrant22 Oct 2020 13:00 UTC
153 points
32 comments22 min readLW link1 review

Fun with +12 OOMs of Compute

Daniel Kokotajlo1 Mar 2021 13:30 UTC
223 points
86 comments12 min readLW link2 reviews

AGI safety from first prin­ci­ples: Introduction

Richard_Ngo28 Sep 2020 19:53 UTC
120 points
18 comments2 min readLW link1 review

An overview of 11 pro­pos­als for build­ing safe ad­vanced AI

evhub29 May 2020 20:38 UTC
205 points
36 comments38 min readLW link2 reviews

Finite Fac­tored Sets

Scott Garrabrant23 May 2021 20:52 UTC
146 points
95 comments24 min readLW link1 review

Your Cheer­ful Price

Eliezer Yudkowsky13 Feb 2021 5:41 UTC
262 points
82 comments17 min readLW link6 reviews

In­tro­duc­tion To The In­fra-Bayesi­anism Sequence

26 Aug 2020 20:31 UTC
108 points
62 comments14 min readLW link2 reviews

Jean Mon­net: The Guerilla Bureaucrat

Martin Sustrik20 Mar 2021 10:37 UTC
174 points
25 comments18 min readLW link1 review

Cry­on­ics signup guide #1: Overview

mingyuan6 Jan 2021 0:25 UTC
150 points
33 comments6 min readLW link1 review

The Solomonoff Prior is Malign

Mark Xu14 Oct 2020 1:33 UTC
168 points
52 comments16 min readLW link3 reviews

Si­mu­lacra Levels and their Interactions

Zvi15 Jun 2020 13:10 UTC
195 points
50 comments17 min readLW link1 review
(thezvi.wordpress.com)

Grokking the In­ten­tional Stance

jbkjr31 Aug 2021 15:49 UTC
43 points
22 comments20 min readLW link1 review

The Treach­er­ous Path to Rationality

Jacob Falkovich9 Oct 2020 15:34 UTC
204 points
115 comments11 min readLW link1 review

The ground of optimization

Alex Flint20 Jun 2020 0:38 UTC
245 points
80 comments27 min readLW link1 review

Real­ity-Re­veal­ing and Real­ity-Mask­ing Puzzles

AnnaSalamon16 Jan 2020 16:15 UTC
258 points
57 comments13 min readLW link1 review

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC
470 points
150 comments16 min readLW link1 review

Effi­cien­tZero: How It Works

1a3orn26 Nov 2021 15:17 UTC
292 points
50 comments29 min readLW link1 review

How fac­to­ries were made safe

jasoncrawford12 Sep 2021 19:58 UTC
179 points
46 comments18 min readLW link1 review
(rootsofprogress.org)

Ngo and Yud­kowsky on al­ign­ment difficulty

15 Nov 2021 20:31 UTC
250 points
148 comments99 min readLW link1 review

What Money Can­not Buy

johnswentworth1 Feb 2020 20:11 UTC
316 points
49 comments4 min readLW link1 review

Leaky Del­e­ga­tion: You are not a Commodity

Darmani25 Jan 2021 2:04 UTC
297 points
34 comments15 min readLW link1 review

Seven Years of Spaced Rep­e­ti­tion Soft­ware in the Classroom

tanagrabeast4 Mar 2021 2:42 UTC
265 points
38 comments34 min readLW link1 review

Some AI re­search ar­eas and their rele­vance to ex­is­ten­tial safety

Andrew_Critch19 Nov 2020 3:18 UTC
204 points
37 comments50 min readLW link2 reviews

Why haven’t we cel­e­brated any ma­jor achieve­ments lately?

jasoncrawford17 Aug 2020 20:34 UTC
194 points
69 comments12 min readLW link2 reviews
(rootsofprogress.org)

Co­or­di­na­tion as a Scarce Resource

johnswentworth25 Jan 2020 23:32 UTC
230 points
22 comments4 min readLW link2 reviews

Trans­porta­tion as a Constraint

johnswentworth6 Apr 2020 4:58 UTC
176 points
27 comments6 min readLW link1 review

Self-In­tegrity and the Drown­ing Child

Eliezer Yudkowsky24 Oct 2021 20:57 UTC
327 points
85 comments5 min readLW link1 review

The Ra­tion­al­ists of the 1950s (and be­fore) also called them­selves “Ra­tion­al­ists”

Owain_Evans28 Nov 2021 20:17 UTC
187 points
30 comments3 min readLW link1 review

Split and Commit

[DEACTIVATED] Duncan Sabien21 Nov 2021 6:27 UTC
178 points
33 comments7 min readLW link1 review

Com­ments on Car­l­smith’s “Is power-seek­ing AI an ex­is­ten­tial risk?”

So8res13 Nov 2021 4:29 UTC
138 points
14 comments40 min readLW link1 review

The First Sam­ple Gives the Most Information

Mark Xu24 Dec 2020 20:39 UTC
132 points
16 comments1 min readLW link1 review
(markxu.com)

My com­pu­ta­tional frame­work for the brain

Steven Byrnes14 Sep 2020 14:19 UTC
150 points
26 comments13 min readLW link1 review

Most Pri­soner’s Dilem­mas are Stag Hunts; Most Stag Hunts are Schel­ling Problems

abramdemski14 Sep 2020 22:13 UTC
177 points
36 comments10 min readLW link3 reviews

Cred­i­bil­ity of the CDC on SARS-CoV-2

7 Mar 2020 2:00 UTC
226 points
119 comments6 min readLW link1 review

How uniform is the neo­cor­tex?

zhukeepa4 May 2020 2:16 UTC
79 points
23 comments11 min readLW link1 review

High­lights from The Au­to­bi­og­ra­phy of An­drew Carnegie

jasoncrawford8 Apr 2021 22:03 UTC
92 points
9 comments19 min readLW link1 review
(rootsofprogress.org)

Why Neu­ral Net­works Gen­er­al­ise, and Why They Are (Kind of) Bayesian

Joar Skalse29 Dec 2020 13:33 UTC
73 points
58 comments1 min readLW link1 review

Against GDP as a met­ric for timelines and take­off speeds

Daniel Kokotajlo29 Dec 2020 17:42 UTC
140 points
18 comments14 min readLW link1 review

microCOVID.org: A tool to es­ti­mate COVID risk from com­mon activities

catherio29 Aug 2020 23:01 UTC
169 points
36 comments1 min readLW link1 review
(microcovid.org)

“Can you keep this con­fi­den­tial? How do you know?”

Raemon21 Jul 2020 0:33 UTC
159 points
41 comments3 min readLW link2 reviews

See­ing the Smoke

Jacob Falkovich28 Feb 2020 18:26 UTC
198 points
29 comments5 min readLW link1 review

In­ter­faces as a Scarce Resource

johnswentworth5 Mar 2020 18:20 UTC
187 points
15 comments12 min readLW link1 review

All Pos­si­ble Views About Hu­man­ity’s Fu­ture Are Wild

HoldenKarnofsky3 Sep 2021 20:19 UTC
146 points
37 comments8 min readLW link1 review

This Can’t Go On

HoldenKarnofsky18 Sep 2021 23:50 UTC
72 points
55 comments7 min readLW link2 reviews

Ta­boo “Out­side View”

Daniel Kokotajlo17 Jun 2021 9:36 UTC
348 points
33 comments8 min readLW link3 reviews

Another (outer) al­ign­ment failure story

paulfchristiano7 Apr 2021 20:12 UTC
241 points
38 comments12 min readLW link1 review

To listen well, get curious

benkuhn13 Dec 2020 0:20 UTC
350 points
37 comments4 min readLW link1 review
(www.benkuhn.net)

Align­ment By Default

johnswentworth12 Aug 2020 18:54 UTC
173 points
94 comments11 min readLW link2 reviews

Cortés, Pizarro, and Afonso as Prece­dents for Takeover

Daniel Kokotajlo1 Mar 2020 3:49 UTC
168 points
78 comments11 min readLW link1 review

Selec­tion The­o­rems: A Pro­gram For Un­der­stand­ing Agents

johnswentworth28 Sep 2021 5:03 UTC
123 points
28 comments6 min readLW link2 reviews

When Money Is Abun­dant, Knowl­edge Is The Real Wealth

johnswentworth3 Nov 2020 17:34 UTC
314 points
61 comments5 min readLW link3 reviews

CFAR Par­ti­ci­pant Hand­book now available to all

[DEACTIVATED] Duncan Sabien3 Jan 2020 15:43 UTC
248 points
40 comments1 min readLW link2 reviews

An Ortho­dox Case Against Utility Functions

abramdemski7 Apr 2020 19:18 UTC
152 points
65 comments8 min readLW link2 reviews

How To Write Quickly While Main­tain­ing Epistemic Rigor

johnswentworth28 Aug 2021 17:52 UTC
427 points
38 comments4 min readLW link3 reviews

Mo­tive Ambiguity

Zvi15 Dec 2020 18:10 UTC
172 points
58 comments4 min readLW link2 reviews
(thezvi.wordpress.com)

Inac­cessible information

paulfchristiano3 Jun 2020 5:10 UTC
83 points
17 comments14 min readLW link2 reviews
(ai-alignment.com)

Dis­con­tin­u­ous progress in his­tory: an update

KatjaGrace14 Apr 2020 0:00 UTC
186 points
25 comments31 min readLW link1 review
(aiimpacts.org)

Fre­quent ar­gu­ments about alignment

John Schulman23 Jun 2021 0:46 UTC
99 points
17 comments5 min readLW link

Pain is not the unit of Effort

alkjash24 Nov 2020 20:00 UTC
508 points
89 comments5 min readLW link2 reviews
(radimentary.wordpress.com)

Rad­i­cal Probabilism

abramdemski18 Aug 2020 21:14 UTC
176 points
47 comments35 min readLW link1 review

Slack Has Pos­i­tive Ex­ter­nal­ities For Groups

johnswentworth29 Jul 2021 15:03 UTC
90 points
11 comments5 min readLW link2 reviews

Science in a High-Di­men­sional World

johnswentworth8 Jan 2021 17:52 UTC
285 points
53 comments7 min readLW link1 review

The Felt Sense: What, Why and How

Kaj_Sotala5 Oct 2020 15:57 UTC
149 points
23 comments14 min readLW link1 review

Choos­ing the Zero Point

orthonormal6 Apr 2020 23:44 UTC
170 points
24 comments3 min readLW link2 reviews

Ra­tion­al­ism be­fore the Sequences

Eric Raymond30 Mar 2021 14:04 UTC
577 points
81 comments11 min readLW link2 reviews

Mak­ing Vaccine

johnswentworth3 Feb 2021 20:24 UTC
574 points
249 comments6 min readLW link3 reviews

A Sketch of Good Communication

Ben Pace31 Mar 2018 22:48 UTC
185 points
35 comments3 min readLW link1 review

Lo­cal Val­idity as a Key to San­ity and Civilization

Eliezer Yudkowsky7 Apr 2018 4:25 UTC
193 points
67 comments13 min readLW link5 reviews

The Loud­est Alarm Is Prob­a­bly False

orthonormal2 Jan 2018 16:38 UTC
171 points
28 comments2 min readLW link1 review

Va­ri­eties Of Ar­gu­men­ta­tive Experience

Scott Alexander8 May 2018 8:20 UTC
93 points
11 comments18 min readLW link2 reviews
(slatestarcodex.com)

Babble

alkjash10 Jan 2018 21:56 UTC
194 points
32 comments5 min readLW link2 reviews
(radimentary.wordpress.com)

Nam­ing the Nameless

sarahconstantin22 Mar 2018 0:35 UTC
119 points
43 comments13 min readLW link3 reviews

Toolbox-think­ing and Law-thinking

Eliezer Yudkowsky31 May 2018 21:28 UTC
160 points
49 comments12 min readLW link

Prune

alkjash12 Jan 2018 22:50 UTC
68 points
10 comments4 min readLW link
(radimentary.wordpress.com)

Towards a New Im­pact Measure

TurnTrout18 Sep 2018 17:21 UTC
100 points
159 comments33 min readLW link2 reviews

Be­ing a Ro­bust Agent

Raemon18 Oct 2018 7:00 UTC
144 points
32 comments7 min readLW link2 reviews

Notic­ing the Taste of Lotus

Valentine27 Apr 2018 20:05 UTC
203 points
81 comments3 min readLW link3 reviews

The Tails Com­ing Apart As Me­taphor For Life

Scott Alexander25 Sep 2018 19:10 UTC
151 points
38 comments7 min readLW link4 reviews
(slatestarcodex.com)

Meta-Hon­esty: Firm­ing Up Hon­esty Around Its Edge-Cases

Eliezer Yudkowsky29 May 2018 0:59 UTC
134 points
152 comments27 min readLW link4 reviews

My at­tempt to ex­plain Look­ing, in­sight med­i­ta­tion, and en­light­en­ment in non-mys­te­ri­ous terms

Kaj_Sotala8 Mar 2018 7:37 UTC
223 points
131 comments17 min readLW link2 reviews

Anti-so­cial Punishment

Martin Sustrik27 Sep 2018 7:08 UTC
296 points
66 comments6 min readLW link3 reviews

The Costly Co­or­di­na­tion Mechanism of Com­mon Knowledge

Ben Pace15 Mar 2018 20:20 UTC
193 points
31 comments19 min readLW link2 reviews

The In­tel­li­gent So­cial Web

Valentine22 Feb 2018 18:55 UTC
224 points
112 comments12 min readLW link2 reviews

Pre­dic­tion Mar­kets: When Do They Work?

Zvi26 Jul 2018 12:30 UTC
162 points
17 comments10 min readLW link
(thezvi.wordpress.com)

Spaghetti Towers

eukaryote22 Dec 2018 5:29 UTC
186 points
28 comments3 min readLW link1 review
(eukaryotewritesblog.com)

On the Loss and Preser­va­tion of Knowledge

Samo Burja8 Mar 2018 18:40 UTC
66 points
20 comments10 min readLW link
(medium.com)

A vot­ing the­ory primer for rationalists

Jameson Quinn12 Apr 2018 15:15 UTC
227 points
97 comments17 min readLW link2 reviews

The Pavlov Strategy

sarahconstantin20 Dec 2018 16:20 UTC
247 points
13 comments4 min readLW link
(srconstantin.wordpress.com)

Inad­e­quate Equil­ibria vs. Gover­nance of the Commons

Martin Sustrik25 May 2018 13:17 UTC
182 points
17 comments14 min readLW link2 reviews

Is Science Slow­ing Down?

Scott Alexander27 Nov 2018 3:30 UTC
125 points
77 comments9 min readLW link1 review
(slatestarcodex.com)

Re­search: Res­cuers dur­ing the Holocaust

Martin Sustrik30 Apr 2018 6:15 UTC
88 points
10 comments9 min readLW link1 review

An Un­trol­lable Mathematician

abramdemski23 Jan 2018 18:46 UTC
23 points
1 comment3 min readLW link

Why did ev­ery­thing take so long?

KatjaGrace29 Dec 2017 1:00 UTC
33 points
17 comments1 min readLW link
(meteuphoric.wordpress.com)

Is Click­bait De­stroy­ing Our Gen­eral In­tel­li­gence?

Eliezer Yudkowsky16 Nov 2018 23:06 UTC
189 points
61 comments5 min readLW link2 reviews

[Question] What makes peo­ple in­tel­lec­tu­ally ac­tive?

abramdemski29 Dec 2018 22:29 UTC
116 points
71 comments1 min readLW link

Open ques­tion: are min­i­mal cir­cuits dae­mon-free?

paulfchristiano5 May 2018 22:40 UTC
84 points
70 comments2 min readLW link1 review

Beyond Astro­nom­i­cal Waste

Wei Dai7 Jun 2018 21:04 UTC
126 points
41 comments3 min readLW link

His­tor­i­cal math­e­mat­i­ci­ans ex­hibit a birth or­der effect too

Eli Tyre21 Aug 2018 1:52 UTC
137 points
18 comments6 min readLW link2 reviews

Birth or­der effect found in No­bel Lau­re­ates in Physics

Bucky4 Sep 2018 12:17 UTC
61 points
25 comments5 min readLW link1 review

Ar­gu­ments about fast takeoff

paulfchristiano25 Feb 2018 4:53 UTC
89 points
65 comments2 min readLW link1 review
(sideways-view.com)

Speci­fi­ca­tion gam­ing ex­am­ples in AI

Vika3 Apr 2018 12:30 UTC
45 points
9 comments1 min readLW link2 reviews

The Rocket Align­ment Problem

Eliezer Yudkowsky4 Oct 2018 0:38 UTC
216 points
41 comments15 min readLW link2 reviews

Embed­ded Agents

29 Oct 2018 19:53 UTC
221 points
41 comments1 min readLW link2 reviews

Paul’s re­search agenda FAQ

zhukeepa1 Jul 2018 6:25 UTC
126 points
74 comments19 min readLW link1 review

Challenges to Chris­ti­ano’s ca­pa­bil­ity am­plifi­ca­tion proposal

Eliezer Yudkowsky19 May 2018 18:18 UTC
124 points
54 comments23 min readLW link1 review

Ro­bust­ness to Scale

Scott Garrabrant21 Feb 2018 22:55 UTC
128 points
23 comments2 min readLW link1 review

Co­her­ence ar­gu­ments do not en­tail goal-di­rected behavior

Rohin Shah3 Dec 2018 3:26 UTC
123 points
69 comments7 min readLW link3 reviews

Rule Thinkers In, Not Out

Scott Alexander27 Feb 2019 2:40 UTC
219 points
67 comments4 min readLW link4 reviews
(slatestarcodex.com)

Gears vs Behavior

johnswentworth19 Sep 2019 6:50 UTC
101 points
13 comments7 min readLW link1 review

Book Re­view: The Se­cret Of Our Success

Scott Alexander5 Jun 2019 6:50 UTC
158 points
19 comments25 min readLW link2 reviews
(slatestarcodex.com)

Rea­son isn’t magic

Benquo18 Jun 2019 4:04 UTC
151 points
19 comments2 min readLW link3 reviews
(benjaminrosshoffman.com)

“Other peo­ple are wrong” vs “I am right”

Buck22 Feb 2019 20:01 UTC
246 points
20 comments9 min readLW link2 reviews

In My Culture

[DEACTIVATED] Duncan Sabien7 Mar 2019 7:22 UTC
66 points
59 comments1 min readLW link2 reviews
(medium.com)

Chris Olah’s views on AGI safety

evhub1 Nov 2019 20:13 UTC
206 points
38 comments12 min readLW link2 reviews

Un­der­stand­ing “Deep Dou­ble Des­cent”

evhub6 Dec 2019 0:00 UTC
146 points
51 comments5 min readLW link4 reviews

How to Ig­nore Your Emo­tions (while also think­ing you’re awe­some at emo­tions)

Hazard31 Jul 2019 13:34 UTC
335 points
72 comments4 min readLW link4 reviews

Paper-Read­ing for Gears

johnswentworth4 Dec 2019 21:02 UTC
159 points
6 comments4 min readLW link1 review

Book sum­mary: Un­lock­ing the Emo­tional Brain

Kaj_Sotala8 Oct 2019 19:11 UTC
313 points
48 comments21 min readLW link3 reviews

Notic­ing Frame Differences

Raemon30 Sep 2019 1:24 UTC
207 points
39 comments9 min readLW link2 reviews

Prop­a­gat­ing Facts into Aesthetics

Raemon19 Dec 2019 4:09 UTC
109 points
35 comments11 min readLW link1 review

Do you fear the rock or the hard place?

Ruby20 Jul 2019 22:01 UTC
67 points
10 comments5 min readLW link3 reviews

Men­tal Mountains

Scott Alexander27 Nov 2019 5:30 UTC
143 points
14 comments15 min readLW link1 review
(slatestarcodex.com)

Steel­man­ning Divination

Vaniver5 Jun 2019 22:53 UTC
191 points
48 comments6 min readLW link2 reviews

Book Re­view: De­sign Prin­ci­ples of Biolog­i­cal Circuits

johnswentworth5 Nov 2019 6:49 UTC
209 points
24 comments12 min readLW link1 review

Refram­ing Su­per­in­tel­li­gence: Com­pre­hen­sive AI Ser­vices as Gen­eral Intelligence

Rohin Shah8 Jan 2019 7:12 UTC
121 points
77 comments5 min readLW link2 reviews
(www.fhi.ox.ac.uk)

Build­ing up to an In­ter­nal Fam­ily Sys­tems model

Kaj_Sotala26 Jan 2019 12:25 UTC
263 points
86 comments28 min readLW link2 reviews

Be­ing the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC
398 points
57 comments3 min readLW link3 reviews

The Schel­ling Choice is “Rab­bit”, not “Stag”

Raemon8 Jun 2019 0:24 UTC
157 points
52 comments12 min readLW link3 reviews

Liter­a­ture Re­view: Distributed Teams

Elizabeth16 Apr 2019 1:19 UTC
106 points
37 comments6 min readLW link1 review

Gears-Level Models are Cap­i­tal Investments

johnswentworth22 Nov 2019 22:41 UTC
170 points
28 comments7 min readLW link1 review

Evolu­tion of Modularity

johnswentworth14 Nov 2019 6:49 UTC
173 points
12 comments2 min readLW link1 review

You Get About Five Words

Raemon12 Mar 2019 20:30 UTC
197 points
76 comments1 min readLW link6 reviews

Co­her­ent de­ci­sions im­ply con­sis­tent utilities

Eliezer Yudkowsky12 May 2019 21:33 UTC
147 points
81 comments26 min readLW link3 reviews

Align­ment Re­search Field Guide

abramdemski8 Mar 2019 19:57 UTC
263 points
9 comments17 min readLW link2 reviews

Fo­rum par­ti­ci­pa­tion as a re­search strategy

Wei Dai30 Jul 2019 18:09 UTC
147 points
44 comments3 min readLW link1 review

The Credit As­sign­ment Problem

abramdemski8 Nov 2019 2:50 UTC
97 points
40 comments17 min readLW link1 review

Asym­met­ric Justice

Zvi25 Apr 2019 16:00 UTC
223 points
101 comments5 min readLW link2 reviews
(thezvi.wordpress.com)

Un­con­scious Economics

jacobjacob27 Feb 2019 12:58 UTC
135 points
30 comments4 min readLW link3 reviews

Power Buys You Dis­tance From The Crime

Elizabeth2 Aug 2019 20:50 UTC
189 points
75 comments7 min readLW link1 review
(acesounderglass.com)

Seek­ing Power is Often Con­ver­gently In­stru­men­tal in MDPs

5 Dec 2019 2:33 UTC
161 points
39 comments17 min readLW link2 reviews
(arxiv.org)

Yes Re­quires the Pos­si­bil­ity of No

Scott Garrabrant17 May 2019 22:39 UTC
241 points
55 comments2 min readLW link2 reviews

Mis­takes with Con­ser­va­tion of Ex­pected Evidence

abramdemski8 Jun 2019 23:07 UTC
212 points
25 comments12 min readLW link1 review

Heads I Win, Tails?—Never Heard of Her; Or, Selec­tive Re­port­ing and the Tragedy of the Green Rationalists

Zack_M_Davis24 Sep 2019 4:12 UTC
296 points
40 comments8 min readLW link2 reviews
No comments.