Best of LessWrong

TagLast edit: 30 Apr 2024 3:09 UTC by habryka

The Best of LessWrong tag is applied to all posts which were voted highly enough in the annual LessWrong review to make it onto the Best of LessWrong page.

ARC’s first technical report: Eliciting Latent Knowledge

paulfchristiano, Mark Xu and Ajeya Cotra

14 Dec 2021 20:09 UTC

225 points

90 comments1 min readLW link 3 reviews

(docs.google.com)

Saving Time

Scott Garrabrant18 May 2021 20:11 UTC

156 points

20 comments4 min readLW link 1 review

Lars Doucet’s Georgism series on Astral Codex Ten

Sune4 Dec 2021 19:43 UTC

13 points

2 comments1 min readLW link 1 review

(astralcodexten.substack.com)

Worst-case thinking in AI alignment

Buck23 Dec 2021 1:29 UTC

162 points

18 comments6 min readLW link 2 reviews

The Point of Trade

Eliezer Yudkowsky22 Jun 2021 17:56 UTC

173 points

76 comments4 min readLW link 1 review

Specializing in Problems We Don’t Understand

johnswentworth10 Apr 2021 22:40 UTC

161 points

29 comments8 min readLW link 1 review

You are probably underestimating how good self-love can be

Charlie Rogers-Smith14 Nov 2021 0:41 UTC

151 points

19 comments12 min readLW link 1 review

Cup-Stacking Skills (or, Reflexive Involuntary Mental Motions)

[DEACTIVATED] Duncan Sabien11 Oct 2021 7:16 UTC

119 points

36 comments7 min readLW link 2 reviews

“PR” is corrosive; “reputation” is not.

AnnaSalamon14 Feb 2021 3:32 UTC

309 points

95 comments2 min readLW link 3 reviews

Nuclear war is unlikely to cause human extinction

Jeffrey Ladish7 Nov 2020 5:42 UTC

125 points

47 comments11 min readLW link 3 reviews

Can crimes be discussed literally?

Benquo22 Mar 2020 20:17 UTC

102 points

38 comments2 min readLW link 3 reviews

(benjaminrosshoffman.com)

The Road to Mazedom

Zvi18 Jan 2020 14:10 UTC

94 points

25 comments7 min readLW link 2 reviews

(thezvi.wordpress.com)

Notes from “Don’t Shoot the Dog”

juliawise2 Apr 2021 16:34 UTC

245 points

11 comments12 min readLW link 1 review

Strong Evidence is Common

Mark Xu13 Mar 2021 22:04 UTC

244 points

50 comments1 min readLW link 4 reviews

(markxu.com)

The date of AI Takeover is not the day the AI takes over

Daniel Kokotajlo22 Oct 2020 10:41 UTC

146 points

32 comments2 min readLW link 1 review

Inner Alignment: Explain like I’m 12 Edition

Rafael Harth1 Aug 2020 15:24 UTC

179 points

46 comments13 min readLW link 2 reviews

Frame Control

Aella27 Nov 2021 22:59 UTC

317 points

283 comments23 min readLW link 2 reviews

What Do GDP Growth Curves Really Mean?

johnswentworth7 Oct 2021 21:58 UTC

219 points

64 comments8 min readLW link 2 reviews

Anti-Aging: State of the Art

JackH31 Dec 2020 19:07 UTC

371 points

176 comments11 min readLW link 1 review

Covid-19: My Current Model

Zvi31 May 2020 17:40 UTC

188 points

74 comments19 min readLW link 1 review

(thezvi.wordpress.com)

Shoulder Advisors 101

[DEACTIVATED] Duncan Sabien9 Oct 2021 5:30 UTC

193 points

124 comments14 min readLW link 2 reviews

Utility Maximization = Description Length Minimization

johnswentworth18 Feb 2021 18:04 UTC

210 points

44 comments5 min readLW link

A non-mystical explanation of “no-self” (three characteristics series)

Kaj_Sotala8 May 2020 10:37 UTC

106 points

65 comments20 min readLW link 1 review

What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

Andrew_Critch31 Mar 2021 23:50 UTC

278 points

65 comments22 min readLW link 1 review

Elephant seal 2

KatjaGrace2 Feb 2021 9:40 UTC

57 points

5 comments1 min readLW link 2 reviews

(worldspiritsockpuppet.com)

The Pointers Problem: Human Values Are A Function Of Humans’ Latent Variables

johnswentworth18 Nov 2020 17:47 UTC

125 points

49 comments11 min readLW link 2 reviews

Studies On Slack

Scott Alexander13 May 2020 5:00 UTC

159 points

34 comments24 min readLW link 1 review

(slatestarcodex.com)

The Death of Behavioral Economics

habryka22 Aug 2021 22:39 UTC

155 points

24 comments1 min readLW link 2 reviews

(www.thebehavioralscientist.com)

Feature Selection

Zack_M_Davis1 Nov 2021 0:22 UTC

317 points

24 comments16 min readLW link 1 review

Draft report on AI timelines

Ajeya Cotra18 Sep 2020 23:47 UTC

214 points

56 comments1 min readLW link 1 review

Search versus design

Alex Flint16 Aug 2020 16:53 UTC

101 points

40 comments36 min readLW link 1 review

Ruling Out Everything Else

[DEACTIVATED] Duncan Sabien27 Oct 2021 21:50 UTC

191 points

51 comments21 min readLW link 2 reviews

There’s no such thing as a tree (phylogenetically)

eukaryote3 May 2021 3:47 UTC

334 points

59 comments8 min readLW link 2 reviews

(eukaryotewritesblog.com)

Simulacrum 3 As Stag-Hunt Strategy

johnswentworth26 Jan 2021 19:40 UTC

183 points

37 comments4 min readLW link 3 reviews

Lies, Damn Lies, and Fabricated Options

[DEACTIVATED] Duncan Sabien17 Oct 2021 2:47 UTC

288 points

132 comments14 min readLW link 2 reviews

Catching the Spark

LoganStrohl30 Jan 2021 23:23 UTC

116 points

21 comments36 min readLW link 1 review

Is Success the Enemy of Freedom? (Full)

alkjash26 Oct 2020 20:25 UTC

292 points

68 comments9 min readLW link 1 review

(radimentary.wordpress.com)

What cognitive biases feel like from the inside

chaosmage3 Jan 2020 14:24 UTC

249 points

32 comments4 min readLW link

Swiss Political System: More than You ever Wanted to Know (I.)

Martin Sustrik19 Jul 2020 1:11 UTC

171 points

39 comments24 min readLW link 2 reviews

My research methodology

paulfchristiano22 Mar 2021 21:20 UTC

159 points

38 comments16 min readLW link 1 review

(ai-alignment.com)

The Plan

johnswentworth10 Dec 2021 23:41 UTC

254 points

78 comments14 min readLW link 1 review

Trapped Priors As A Basic Problem Of Rationality

Scott Alexander12 Mar 2021 20:02 UTC

143 points

32 comments14 min readLW link 3 reviews

The Alignment Problem: Machine Learning and Human Values

Rohin Shah6 Oct 2020 17:41 UTC

120 points

7 comments6 min readLW link 1 review

(www.amazon.com)

Introduction to Cartesian Frames

Scott Garrabrant22 Oct 2020 13:00 UTC

153 points

32 comments22 min readLW link 1 review

Fun with +12 OOMs of Compute

Daniel Kokotajlo1 Mar 2021 13:30 UTC

224 points

86 comments12 min readLW link 2 reviews

AGI safety from first principles: Introduction

Richard_Ngo28 Sep 2020 19:53 UTC

122 points

18 comments2 min readLW link 1 review

An overview of 11 proposals for building safe advanced AI

evhub29 May 2020 20:38 UTC

211 points

36 comments38 min readLW link 2 reviews

Finite Factored Sets

Scott Garrabrant23 May 2021 20:52 UTC

146 points

95 comments24 min readLW link 1 review

Your Cheerful Price

Eliezer Yudkowsky13 Feb 2021 5:41 UTC

263 points

82 comments17 min readLW link 6 reviews

Introduction To The Infra-Bayesianism Sequence

Diffractor and Vanessa Kosoy

26 Aug 2020 20:31 UTC

109 points

62 comments14 min readLW link 2 reviews

Jean Monnet: The Guerilla Bureaucrat

Martin Sustrik20 Mar 2021 10:37 UTC

177 points

25 comments18 min readLW link 1 review

Cryonics signup guide #1: Overview

mingyuan6 Jan 2021 0:25 UTC

150 points

34 comments6 min readLW link 1 review

The Solomonoff Prior is Malign

Mark Xu14 Oct 2020 1:33 UTC

169 points

52 comments16 min readLW link 3 reviews

Simulacra Levels and their Interactions

Zvi15 Jun 2020 13:10 UTC

198 points

50 comments17 min readLW link 1 review

(thezvi.wordpress.com)

Grokking the Intentional Stance

jbkjr31 Aug 2021 15:49 UTC

45 points

22 comments20 min readLW link 1 review

The Treacherous Path to Rationality

Jacob Falkovich9 Oct 2020 15:34 UTC

205 points

115 comments11 min readLW link 1 review

The ground of optimization

Alex Flint20 Jun 2020 0:38 UTC

242 points

80 comments27 min readLW link 1 review

Reality-Revealing and Reality-Masking Puzzles

AnnaSalamon16 Jan 2020 16:15 UTC

259 points

57 comments13 min readLW link 1 review

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC

477 points

153 comments16 min readLW link 1 review

EfficientZero: How It Works

1a3orn26 Nov 2021 15:17 UTC

297 points

50 comments29 min readLW link 1 review

How factories were made safe

jasoncrawford12 Sep 2021 19:58 UTC

182 points

46 comments18 min readLW link 1 review

(rootsofprogress.org)

Ngo and Yudkowsky on alignment difficulty

Eliezer Yudkowsky and Richard_Ngo

15 Nov 2021 20:31 UTC

250 points

148 comments99 min readLW link 1 review

What Money Cannot Buy

johnswentworth1 Feb 2020 20:11 UTC

323 points

50 comments4 min readLW link 1 review

Leaky Delegation: You are not a Commodity

Darmani25 Jan 2021 2:04 UTC

297 points

34 comments15 min readLW link 1 review

Seven Years of Spaced Repetition Software in the Classroom

tanagrabeast4 Mar 2021 2:42 UTC

266 points

38 comments34 min readLW link 1 review

Some AI research areas and their relevance to existential safety

Andrew_Critch19 Nov 2020 3:18 UTC

204 points

37 comments50 min readLW link 2 reviews

Why haven’t we celebrated any major achievements lately?

jasoncrawford17 Aug 2020 20:34 UTC

194 points

69 comments12 min readLW link 2 reviews

(rootsofprogress.org)

Coordination as a Scarce Resource

johnswentworth25 Jan 2020 23:32 UTC

238 points

22 comments4 min readLW link 2 reviews

Transportation as a Constraint

johnswentworth6 Apr 2020 4:58 UTC

180 points

33 comments6 min readLW link 1 review

Self-Integrity and the Drowning Child

Eliezer Yudkowsky24 Oct 2021 20:57 UTC

331 points

85 comments5 min readLW link 1 review

The Rationalists of the 1950s (and before) also called themselves “Rationalists”

Owain_Evans28 Nov 2021 20:17 UTC

188 points

32 comments3 min readLW link 1 review

Split and Commit

[DEACTIVATED] Duncan Sabien21 Nov 2021 6:27 UTC

182 points

34 comments7 min readLW link 1 review

Comments on Carlsmith’s “Is power-seeking AI an existential risk?”

So8res13 Nov 2021 4:29 UTC

138 points

15 comments40 min readLW link 1 review

The First Sample Gives the Most Information

Mark Xu24 Dec 2020 20:39 UTC

133 points

16 comments1 min readLW link 1 review

(markxu.com)

My computational framework for the brain

Steven Byrnes14 Sep 2020 14:19 UTC

150 points

26 comments13 min readLW link 1 review

Most Prisoner’s Dilemmas are Stag Hunts; Most Stag Hunts are Schelling Problems

abramdemski14 Sep 2020 22:13 UTC

177 points

36 comments10 min readLW link 3 reviews

Credibility of the CDC on SARS-CoV-2

Elizabeth and jimrandomh

7 Mar 2020 2:00 UTC

226 points

119 comments6 min readLW link 1 review

How uniform is the neocortex?

zhukeepa4 May 2020 2:16 UTC

79 points

23 comments11 min readLW link 1 review

Highlights from The Autobiography of Andrew Carnegie

jasoncrawford8 Apr 2021 22:03 UTC

92 points

9 comments19 min readLW link 1 review

(rootsofprogress.org)

Why Neural Networks Generalise, and Why They Are (Kind of) Bayesian

Joar Skalse29 Dec 2020 13:33 UTC

74 points

58 comments1 min readLW link 1 review

Against GDP as a metric for timelines and takeoff speeds

Daniel Kokotajlo29 Dec 2020 17:42 UTC

140 points

19 comments14 min readLW link 1 review

microCOVID.org: A tool to estimate COVID risk from common activities

catherio29 Aug 2020 23:01 UTC

169 points

36 comments1 min readLW link 1 review

(microcovid.org)

“Can you keep this confidential? How do you know?”

Raemon21 Jul 2020 0:33 UTC

161 points

42 comments3 min readLW link 2 reviews

Seeing the Smoke

Jacob Falkovich28 Feb 2020 18:26 UTC

198 points

29 comments5 min readLW link 1 review

Interfaces as a Scarce Resource

johnswentworth5 Mar 2020 18:20 UTC

188 points

15 comments12 min readLW link 1 review

All Possible Views About Humanity’s Future Are Wild

HoldenKarnofsky3 Sep 2021 20:19 UTC

146 points

37 comments8 min readLW link 1 review

This Can’t Go On

HoldenKarnofsky18 Sep 2021 23:50 UTC

74 points

55 comments7 min readLW link 2 reviews

Taboo “Outside View”

Daniel Kokotajlo17 Jun 2021 9:36 UTC

348 points

33 comments8 min readLW link 3 reviews

Another (outer) alignment failure story

paulfchristiano7 Apr 2021 20:12 UTC

244 points

38 comments12 min readLW link 1 review

To listen well, get curious

benkuhn13 Dec 2020 0:20 UTC

356 points

37 comments4 min readLW link 1 review

(www.benkuhn.net)

Alignment By Default

johnswentworth12 Aug 2020 18:54 UTC

174 points

94 comments11 min readLW link 2 reviews

Cortés, Pizarro, and Afonso as Precedents for Takeover

Daniel Kokotajlo1 Mar 2020 3:49 UTC

177 points

78 comments11 min readLW link 1 review

Selection Theorems: A Program For Understanding Agents

johnswentworth28 Sep 2021 5:03 UTC

123 points

28 comments6 min readLW link 2 reviews

When Money Is Abundant, Knowledge Is The Real Wealth

johnswentworth3 Nov 2020 17:34 UTC

321 points

61 comments5 min readLW link 3 reviews

CFAR Participant Handbook now available to all

[DEACTIVATED] Duncan Sabien3 Jan 2020 15:43 UTC

248 points

40 comments1 min readLW link 2 reviews

An Orthodox Case Against Utility Functions

abramdemski7 Apr 2020 19:18 UTC

152 points

65 comments8 min readLW link 2 reviews

How To Write Quickly While Maintaining Epistemic Rigor

johnswentworth28 Aug 2021 17:52 UTC

435 points

38 comments4 min readLW link 3 reviews

Motive Ambiguity

Zvi15 Dec 2020 18:10 UTC

172 points

58 comments4 min readLW link 2 reviews

(thezvi.wordpress.com)

Inaccessible information

paulfchristiano3 Jun 2020 5:10 UTC

83 points

17 comments14 min readLW link 2 reviews

(ai-alignment.com)

Discontinuous progress in history: an update

KatjaGrace14 Apr 2020 0:00 UTC

189 points

25 comments31 min readLW link 1 review

(aiimpacts.org)

Frequent arguments about alignment

John Schulman23 Jun 2021 0:46 UTC

103 points

17 comments5 min readLW link

Pain is not the unit of Effort

alkjash24 Nov 2020 20:00 UTC

526 points

89 comments5 min readLW link 2 reviews

(radimentary.wordpress.com)

Radical Probabilism

abramdemski18 Aug 2020 21:14 UTC

176 points

47 comments35 min readLW link 1 review

Slack Has Positive Externalities For Groups

johnswentworth29 Jul 2021 15:03 UTC

95 points

11 comments5 min readLW link 2 reviews

Science in a High-Dimensional World

johnswentworth8 Jan 2021 17:52 UTC

286 points

53 comments7 min readLW link 1 review

The Felt Sense: What, Why and How

Kaj_Sotala5 Oct 2020 15:57 UTC

152 points

23 comments14 min readLW link 1 review

Choosing the Zero Point

orthonormal6 Apr 2020 23:44 UTC

170 points

24 comments3 min readLW link 2 reviews

Rationalism before the Sequences

Eric Raymond30 Mar 2021 14:04 UTC

582 points

81 comments11 min readLW link 2 reviews

Making Vaccine

johnswentworth3 Feb 2021 20:24 UTC

574 points

249 comments6 min readLW link 3 reviews

A Sketch of Good Communication

Ben Pace31 Mar 2018 22:48 UTC

200 points

35 comments3 min readLW link 1 review

Local Validity as a Key to Sanity and Civilization

Eliezer Yudkowsky7 Apr 2018 4:25 UTC

195 points

67 comments13 min readLW link 5 reviews

The Loudest Alarm Is Probably False

orthonormal2 Jan 2018 16:38 UTC

173 points

28 comments2 min readLW link 1 review

Varieties Of Argumentative Experience

Scott Alexander8 May 2018 8:20 UTC

96 points

11 comments18 min readLW link 2 reviews

(slatestarcodex.com)

Babble

alkjash10 Jan 2018 21:56 UTC

201 points

32 comments5 min readLW link 2 reviews

(radimentary.wordpress.com)

Naming the Nameless

sarahconstantin22 Mar 2018 0:35 UTC

122 points

43 comments13 min readLW link 3 reviews

Toolbox-thinking and Law-thinking

Eliezer Yudkowsky31 May 2018 21:28 UTC

165 points

49 comments12 min readLW link

Prune

alkjash12 Jan 2018 22:50 UTC

69 points

10 comments4 min readLW link

(radimentary.wordpress.com)

Towards a New Impact Measure

TurnTrout18 Sep 2018 17:21 UTC

100 points

159 comments33 min readLW link 2 reviews

Being a Robust Agent

Raemon18 Oct 2018 7:00 UTC

149 points

32 comments7 min readLW link 2 reviews

Noticing the Taste of Lotus

Valentine27 Apr 2018 20:05 UTC

204 points

81 comments3 min readLW link 3 reviews

The Tails Coming Apart As Metaphor For Life

Scott Alexander25 Sep 2018 19:10 UTC

157 points

38 comments7 min readLW link 4 reviews

(slatestarcodex.com)

Meta-Honesty: Firming Up Honesty Around Its Edge-Cases

Eliezer Yudkowsky29 May 2018 0:59 UTC

134 points

152 comments27 min readLW link 4 reviews

My attempt to explain Looking, insight meditation, and enlightenment in non-mysterious terms

Kaj_Sotala8 Mar 2018 7:37 UTC

223 points

135 comments17 min readLW link 2 reviews

Anti-social Punishment

Martin Sustrik27 Sep 2018 7:08 UTC

304 points

66 comments6 min readLW link 3 reviews

The Costly Coordination Mechanism of Common Knowledge

Ben Pace15 Mar 2018 20:20 UTC

196 points

31 comments19 min readLW link 2 reviews

The Intelligent Social Web

Valentine22 Feb 2018 18:55 UTC

228 points

112 comments12 min readLW link 2 reviews

Prediction Markets: When Do They Work?

Zvi26 Jul 2018 12:30 UTC

162 points

17 comments10 min readLW link

(thezvi.wordpress.com)

Spaghetti Towers

eukaryote22 Dec 2018 5:29 UTC

200 points

36 comments3 min readLW link 1 review

(eukaryotewritesblog.com)

On the Loss and Preservation of Knowledge

Samo Burja8 Mar 2018 18:40 UTC

71 points

21 comments10 min readLW link

(medium.com)

A voting theory primer for rationalists

Jameson Quinn12 Apr 2018 15:15 UTC

233 points

98 comments17 min readLW link 2 reviews

The Pavlov Strategy

sarahconstantin20 Dec 2018 16:20 UTC

268 points

14 comments4 min readLW link

(srconstantin.wordpress.com)

Inadequate Equilibria vs. Governance of the Commons

Martin Sustrik25 May 2018 13:17 UTC

184 points

17 comments14 min readLW link 2 reviews

Is Science Slowing Down?

Scott Alexander27 Nov 2018 3:30 UTC

126 points

77 comments9 min readLW link 1 review

(slatestarcodex.com)

Research: Rescuers during the Holocaust

Martin Sustrik30 Apr 2018 6:15 UTC

96 points

14 comments9 min readLW link 1 review

An Untrollable Mathematician

abramdemski23 Jan 2018 18:46 UTC

23 points

1 comment3 min readLW link

Why did everything take so long?

KatjaGrace29 Dec 2017 1:00 UTC

34 points

17 comments1 min readLW link

(meteuphoric.wordpress.com)

Is Clickbait Destroying Our General Intelligence?

Eliezer Yudkowsky16 Nov 2018 23:06 UTC

190 points

65 comments5 min readLW link 2 reviews

[Question] What makes people intellectually active?

abramdemski29 Dec 2018 22:29 UTC

120 points

72 comments1 min readLW link

Open question: are minimal circuits daemon-free?

paulfchristiano5 May 2018 22:40 UTC

83 points

70 comments2 min readLW link 1 review

Beyond Astronomical Waste

Wei Dai7 Jun 2018 21:04 UTC

132 points

41 comments3 min readLW link

Historical mathematicians exhibit a birth order effect too

Eli Tyre21 Aug 2018 1:52 UTC

141 points

19 comments6 min readLW link 2 reviews

Birth order effect found in Nobel Laureates in Physics

Bucky4 Sep 2018 12:17 UTC

61 points

25 comments5 min readLW link 1 review

Arguments about fast takeoff

paulfchristiano25 Feb 2018 4:53 UTC

92 points

66 comments2 min readLW link 1 review

(sideways-view.com)

Specification gaming examples in AI

Vika3 Apr 2018 12:30 UTC

45 points

9 comments1 min readLW link 2 reviews

The Rocket Alignment Problem

Eliezer Yudkowsky4 Oct 2018 0:38 UTC

217 points

41 comments15 min readLW link 2 reviews

Embedded Agents

abramdemski and Scott Garrabrant

29 Oct 2018 19:53 UTC

222 points

41 comments1 min readLW link 2 reviews

Paul’s research agenda FAQ

zhukeepa1 Jul 2018 6:25 UTC

126 points

74 comments19 min readLW link 1 review

Challenges to Christiano’s capability amplification proposal

Eliezer Yudkowsky19 May 2018 18:18 UTC

124 points

54 comments23 min readLW link 1 review

Robustness to Scale

Scott Garrabrant21 Feb 2018 22:55 UTC

129 points

23 comments2 min readLW link 1 review

Coherence arguments do not entail goal-directed behavior

Rohin Shah3 Dec 2018 3:26 UTC

129 points

69 comments7 min readLW link 3 reviews

Rule Thinkers In, Not Out

Scott Alexander27 Feb 2019 2:40 UTC

225 points

67 comments4 min readLW link 4 reviews

(slatestarcodex.com)

Gears vs Behavior

johnswentworth19 Sep 2019 6:50 UTC

112 points

13 comments7 min readLW link 1 review

Book Review: The Secret Of Our Success

Scott Alexander5 Jun 2019 6:50 UTC

158 points

19 comments25 min readLW link 2 reviews

(slatestarcodex.com)

Reason isn’t magic

Benquo18 Jun 2019 4:04 UTC

154 points

19 comments2 min readLW link 3 reviews

(benjaminrosshoffman.com)

“Other people are wrong” vs “I am right”

Buck22 Feb 2019 20:01 UTC

259 points

20 comments9 min readLW link 2 reviews

In My Culture

[DEACTIVATED] Duncan Sabien7 Mar 2019 7:22 UTC

66 points

59 comments1 min readLW link 2 reviews

(medium.com)

Chris Olah’s views on AGI safety

evhub1 Nov 2019 20:13 UTC

207 points

38 comments12 min readLW link 2 reviews

Understanding “Deep Double Descent”

evhub6 Dec 2019 0:00 UTC

149 points

51 comments5 min readLW link 4 reviews

How to Ignore Your Emotions (while also thinking you’re awesome at emotions)

Hazard31 Jul 2019 13:34 UTC

356 points

74 comments4 min readLW link 4 reviews

Paper-Reading for Gears

johnswentworth4 Dec 2019 21:02 UTC

163 points

6 comments4 min readLW link 1 review

Book summary: Unlocking the Emotional Brain

Kaj_Sotala8 Oct 2019 19:11 UTC

319 points

48 comments21 min readLW link 3 reviews

Noticing Frame Differences

Raemon30 Sep 2019 1:24 UTC

212 points

39 comments9 min readLW link 2 reviews

Propagating Facts into Aesthetics

Raemon19 Dec 2019 4:09 UTC

120 points

37 comments11 min readLW link 1 review

Do you fear the rock or the hard place?

Ruby20 Jul 2019 22:01 UTC

73 points

10 comments5 min readLW link 3 reviews

Mental Mountains

Scott Alexander27 Nov 2019 5:30 UTC

146 points

14 comments15 min readLW link 1 review

(slatestarcodex.com)

Steelmanning Divination

Vaniver5 Jun 2019 22:53 UTC

200 points

48 comments6 min readLW link 2 reviews

Book Review: Design Principles of Biological Circuits

johnswentworth5 Nov 2019 6:49 UTC

209 points

24 comments12 min readLW link 1 review

Reframing Superintelligence: Comprehensive AI Services as General Intelligence

Rohin Shah8 Jan 2019 7:12 UTC

121 points

77 comments5 min readLW link 2 reviews

(www.fhi.ox.ac.uk)

Building up to an Internal Family Systems model

Kaj_Sotala26 Jan 2019 12:25 UTC

267 points

86 comments28 min readLW link 2 reviews

Being the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC

413 points

57 comments3 min readLW link 3 reviews

The Schelling Choice is “Rabbit”, not “Stag”

Raemon8 Jun 2019 0:24 UTC

160 points

52 comments12 min readLW link 3 reviews

Literature Review: Distributed Teams

Elizabeth16 Apr 2019 1:19 UTC

106 points

37 comments6 min readLW link 1 review

Gears-Level Models are Capital Investments

johnswentworth22 Nov 2019 22:41 UTC

172 points

28 comments7 min readLW link 1 review

Evolution of Modularity

johnswentworth14 Nov 2019 6:49 UTC

176 points

12 comments2 min readLW link 1 review

You Get About Five Words

Raemon12 Mar 2019 20:30 UTC

214 points

80 comments1 min readLW link 6 reviews

Coherent decisions imply consistent utilities

Eliezer Yudkowsky12 May 2019 21:33 UTC

149 points

81 comments26 min readLW link 3 reviews

Alignment Research Field Guide

abramdemski8 Mar 2019 19:57 UTC

266 points

9 comments17 min readLW link 2 reviews

Forum participation as a research strategy

Wei Dai30 Jul 2019 18:09 UTC

151 points

45 comments3 min readLW link 1 review

The Credit Assignment Problem

abramdemski8 Nov 2019 2:50 UTC

98 points

40 comments17 min readLW link 1 review

Asymmetric Justice

Zvi25 Apr 2019 16:00 UTC

231 points

104 comments5 min readLW link 2 reviews

(thezvi.wordpress.com)

Unconscious Economics

jacobjacob27 Feb 2019 12:58 UTC

137 points

30 comments4 min readLW link 3 reviews

Power Buys You Distance From The Crime

Elizabeth2 Aug 2019 20:50 UTC

203 points

75 comments7 min readLW link 1 review

(acesounderglass.com)

Seeking Power is Often Convergently Instrumental in MDPs

TurnTrout and Logan Riggs

5 Dec 2019 2:33 UTC

162 points

39 comments17 min readLW link 2 reviews

(arxiv.org)

Yes Requires the Possibility of No

Scott Garrabrant17 May 2019 22:39 UTC

270 points

55 comments2 min readLW link 2 reviews

Mistakes with Conservation of Expected Evidence

abramdemski8 Jun 2019 23:07 UTC

222 points

25 comments12 min readLW link 1 review

Heads I Win, Tails?—Never Heard of Her; Or, Selective Reporting and the Tragedy of the Green Rationalists

Zack_M_Davis24 Sep 2019 4:12 UTC

299 points

40 comments8 min readLW link 2 reviews

No comments.