Best of LessWrong

TagLast edit: 30 Apr 2024 3:09 UTC by habryka

The Best of LessWrong tag is applied to all posts which were voted highly enough in the annual LessWrong review to make it onto the Best of LessWrong page.

ARC’s first technical report: Eliciting Latent Knowledge

paulfchristiano, Mark Xu and Ajeya Cotra

14 Dec 2021 20:09 UTC

225 points

90 comments1 min readLW link 3 reviews

(docs.google.com)

Saving Time

Scott Garrabrant18 May 2021 20:11 UTC

156 points

20 comments4 min readLW link 1 review

Lars Doucet’s Georgism series on Astral Codex Ten

Sune4 Dec 2021 19:43 UTC

13 points

2 comments1 min readLW link 1 review

(astralcodexten.substack.com)

Worst-case thinking in AI alignment

Buck23 Dec 2021 1:29 UTC

162 points

18 comments6 min readLW link 2 reviews

The Point of Trade

Eliezer Yudkowsky22 Jun 2021 17:56 UTC

171 points

76 comments4 min readLW link 1 review

Specializing in Problems We Don’t Understand

johnswentworth10 Apr 2021 22:40 UTC

159 points

29 comments8 min readLW link 1 review

You are probably underestimating how good self-love can be

Charlie Rogers-Smith14 Nov 2021 0:41 UTC

145 points

19 comments12 min readLW link 1 review

Cup-Stacking Skills (or, Reflexive Involuntary Mental Motions)

[DEACTIVATED] Duncan Sabien11 Oct 2021 7:16 UTC

117 points

36 comments7 min readLW link 2 reviews

“PR” is corrosive; “reputation” is not.

AnnaSalamon14 Feb 2021 3:32 UTC

307 points

93 comments2 min readLW link 3 reviews

Nuclear war is unlikely to cause human extinction

Jeffrey Ladish7 Nov 2020 5:42 UTC

124 points

47 comments11 min readLW link 3 reviews

Can crimes be discussed literally?

Benquo22 Mar 2020 20:17 UTC

102 points

38 comments2 min readLW link 3 reviews

(benjaminrosshoffman.com)

The Road to Mazedom

Zvi18 Jan 2020 14:10 UTC

94 points

25 comments7 min readLW link 2 reviews

(thezvi.wordpress.com)

Notes from “Don’t Shoot the Dog”

juliawise2 Apr 2021 16:34 UTC

244 points

11 comments12 min readLW link 1 review

Strong Evidence is Common

Mark Xu13 Mar 2021 22:04 UTC

244 points

49 comments1 min readLW link 4 reviews

(markxu.com)

The date of AI Takeover is not the day the AI takes over

Daniel Kokotajlo22 Oct 2020 10:41 UTC

145 points

32 comments2 min readLW link 1 review

Inner Alignment: Explain like I’m 12 Edition

Rafael Harth1 Aug 2020 15:24 UTC

179 points

46 comments13 min readLW link 2 reviews

Frame Control

Aella27 Nov 2021 22:59 UTC

314 points

282 comments23 min readLW link 2 reviews

What Do GDP Growth Curves Really Mean?

johnswentworth7 Oct 2021 21:58 UTC

219 points

64 comments8 min readLW link 2 reviews

Anti-Aging: State of the Art

JackH31 Dec 2020 19:07 UTC

371 points

176 comments11 min readLW link 1 review

Covid-19: My Current Model

Zvi31 May 2020 17:40 UTC

188 points

74 comments19 min readLW link 1 review

(thezvi.wordpress.com)

Shoulder Advisors 101

[DEACTIVATED] Duncan Sabien9 Oct 2021 5:30 UTC

193 points

124 comments14 min readLW link 2 reviews

Utility Maximization = Description Length Minimization

johnswentworth18 Feb 2021 18:04 UTC

208 points

44 comments5 min readLW link

A non-mystical explanation of “no-self” (three characteristics series)

Kaj_Sotala8 May 2020 10:37 UTC

105 points

65 comments20 min readLW link 1 review

What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

Andrew_Critch31 Mar 2021 23:50 UTC

272 points

64 comments22 min readLW link 1 review

Elephant seal 2

KatjaGrace2 Feb 2021 9:40 UTC

57 points

5 comments1 min readLW link 2 reviews

(worldspiritsockpuppet.com)

The Pointers Problem: Human Values Are A Function Of Humans’ Latent Variables

johnswentworth18 Nov 2020 17:47 UTC

124 points

49 comments11 min readLW link 2 reviews

Studies On Slack

Scott Alexander13 May 2020 5:00 UTC

151 points

34 comments24 min readLW link 1 review

(slatestarcodex.com)

The Death of Behavioral Economics

habryka22 Aug 2021 22:39 UTC

154 points

24 comments1 min readLW link 2 reviews

(www.thebehavioralscientist.com)

Feature Selection

Zack_M_Davis1 Nov 2021 0:22 UTC

315 points

24 comments16 min readLW link 1 review

Draft report on AI timelines

Ajeya Cotra18 Sep 2020 23:47 UTC

214 points

56 comments1 min readLW link 1 review

Search versus design

Alex Flint16 Aug 2020 16:53 UTC

100 points

40 comments36 min readLW link 1 review

Ruling Out Everything Else

[DEACTIVATED] Duncan Sabien27 Oct 2021 21:50 UTC

190 points

51 comments21 min readLW link 2 reviews

There’s no such thing as a tree (phylogenetically)

eukaryote3 May 2021 3:47 UTC

333 points

58 comments8 min readLW link 2 reviews

(eukaryotewritesblog.com)

Simulacrum 3 As Stag-Hunt Strategy

johnswentworth26 Jan 2021 19:40 UTC

179 points

37 comments4 min readLW link 3 reviews

Lies, Damn Lies, and Fabricated Options

[DEACTIVATED] Duncan Sabien17 Oct 2021 2:47 UTC

288 points

132 comments14 min readLW link 2 reviews

Catching the Spark

LoganStrohl30 Jan 2021 23:23 UTC

111 points

21 comments36 min readLW link 1 review

Is Success the Enemy of Freedom? (Full)

alkjash26 Oct 2020 20:25 UTC

291 points

68 comments9 min readLW link 1 review

(radimentary.wordpress.com)

What cognitive biases feel like from the inside

chaosmage3 Jan 2020 14:24 UTC

249 points

32 comments4 min readLW link

Swiss Political System: More than You ever Wanted to Know (I.)

Martin Sustrik19 Jul 2020 1:11 UTC

171 points

39 comments24 min readLW link 2 reviews

My research methodology

paulfchristiano22 Mar 2021 21:20 UTC

159 points

38 comments16 min readLW link 1 review

(ai-alignment.com)

The Plan

johnswentworth10 Dec 2021 23:41 UTC

254 points

78 comments14 min readLW link 1 review

Trapped Priors As A Basic Problem Of Rationality

Scott Alexander12 Mar 2021 20:02 UTC

141 points

32 comments14 min readLW link 3 reviews

The Alignment Problem: Machine Learning and Human Values

Rohin Shah6 Oct 2020 17:41 UTC

120 points

7 comments6 min readLW link 1 review

(www.amazon.com)

Introduction to Cartesian Frames

Scott Garrabrant22 Oct 2020 13:00 UTC

153 points

32 comments22 min readLW link 1 review

Fun with +12 OOMs of Compute

Daniel Kokotajlo1 Mar 2021 13:30 UTC

224 points

86 comments12 min readLW link 2 reviews

AGI safety from first principles: Introduction

Richard_Ngo28 Sep 2020 19:53 UTC

121 points

18 comments2 min readLW link 1 review

An overview of 11 proposals for building safe advanced AI

evhub29 May 2020 20:38 UTC

205 points

36 comments38 min readLW link 2 reviews

Finite Factored Sets

Scott Garrabrant23 May 2021 20:52 UTC

146 points

95 comments24 min readLW link 1 review

Your Cheerful Price

Eliezer Yudkowsky13 Feb 2021 5:41 UTC

262 points

82 comments17 min readLW link 6 reviews

Introduction To The Infra-Bayesianism Sequence

Diffractor and Vanessa Kosoy

26 Aug 2020 20:31 UTC

109 points

62 comments14 min readLW link 2 reviews

Jean Monnet: The Guerilla Bureaucrat

Martin Sustrik20 Mar 2021 10:37 UTC

175 points

25 comments18 min readLW link 1 review

Cryonics signup guide #1: Overview

mingyuan6 Jan 2021 0:25 UTC

150 points

33 comments6 min readLW link 1 review

The Solomonoff Prior is Malign

Mark Xu14 Oct 2020 1:33 UTC

168 points

52 comments16 min readLW link 3 reviews

Simulacra Levels and their Interactions

Zvi15 Jun 2020 13:10 UTC

197 points

50 comments17 min readLW link 1 review

(thezvi.wordpress.com)

Grokking the Intentional Stance

jbkjr31 Aug 2021 15:49 UTC

43 points

22 comments20 min readLW link 1 review

The Treacherous Path to Rationality

Jacob Falkovich9 Oct 2020 15:34 UTC

204 points

115 comments11 min readLW link 1 review

The ground of optimization

Alex Flint20 Jun 2020 0:38 UTC

245 points

80 comments27 min readLW link 1 review

Reality-Revealing and Reality-Masking Puzzles

AnnaSalamon16 Jan 2020 16:15 UTC

258 points

57 comments13 min readLW link 1 review

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC

473 points

150 comments16 min readLW link 1 review

EfficientZero: How It Works

1a3orn26 Nov 2021 15:17 UTC

292 points

50 comments29 min readLW link 1 review

How factories were made safe

jasoncrawford12 Sep 2021 19:58 UTC

181 points

46 comments18 min readLW link 1 review

(rootsofprogress.org)

Ngo and Yudkowsky on alignment difficulty

Eliezer Yudkowsky and Richard_Ngo

15 Nov 2021 20:31 UTC

250 points

148 comments99 min readLW link 1 review

What Money Cannot Buy

johnswentworth1 Feb 2020 20:11 UTC

318 points

49 comments4 min readLW link 1 review

Leaky Delegation: You are not a Commodity

Darmani25 Jan 2021 2:04 UTC

297 points

34 comments15 min readLW link 1 review

Seven Years of Spaced Repetition Software in the Classroom

tanagrabeast4 Mar 2021 2:42 UTC

265 points

38 comments34 min readLW link 1 review

Some AI research areas and their relevance to existential safety

Andrew_Critch19 Nov 2020 3:18 UTC

204 points

37 comments50 min readLW link 2 reviews

Why haven’t we celebrated any major achievements lately?

jasoncrawford17 Aug 2020 20:34 UTC

194 points

69 comments12 min readLW link 2 reviews

(rootsofprogress.org)

Coordination as a Scarce Resource

johnswentworth25 Jan 2020 23:32 UTC

231 points

22 comments4 min readLW link 2 reviews

Transportation as a Constraint

johnswentworth6 Apr 2020 4:58 UTC

177 points

32 comments6 min readLW link 1 review

Self-Integrity and the Drowning Child

Eliezer Yudkowsky24 Oct 2021 20:57 UTC

329 points

85 comments5 min readLW link 1 review

The Rationalists of the 1950s (and before) also called themselves “Rationalists”

Owain_Evans28 Nov 2021 20:17 UTC

187 points

30 comments3 min readLW link 1 review

Split and Commit

[DEACTIVATED] Duncan Sabien21 Nov 2021 6:27 UTC

178 points

33 comments7 min readLW link 1 review

Comments on Carlsmith’s “Is power-seeking AI an existential risk?”

So8res13 Nov 2021 4:29 UTC

138 points

14 comments40 min readLW link 1 review

The First Sample Gives the Most Information

Mark Xu24 Dec 2020 20:39 UTC

132 points

16 comments1 min readLW link 1 review

(markxu.com)

My computational framework for the brain

Steven Byrnes14 Sep 2020 14:19 UTC

150 points

26 comments13 min readLW link 1 review

Most Prisoner’s Dilemmas are Stag Hunts; Most Stag Hunts are Schelling Problems

abramdemski14 Sep 2020 22:13 UTC

177 points

36 comments10 min readLW link 3 reviews

Credibility of the CDC on SARS-CoV-2

Elizabeth and jimrandomh

7 Mar 2020 2:00 UTC

226 points

119 comments6 min readLW link 1 review

How uniform is the neocortex?

zhukeepa4 May 2020 2:16 UTC

79 points

23 comments11 min readLW link 1 review

Highlights from The Autobiography of Andrew Carnegie

jasoncrawford8 Apr 2021 22:03 UTC

92 points

9 comments19 min readLW link 1 review

(rootsofprogress.org)

Why Neural Networks Generalise, and Why They Are (Kind of) Bayesian

Joar Skalse29 Dec 2020 13:33 UTC

74 points

58 comments1 min readLW link 1 review

Against GDP as a metric for timelines and takeoff speeds

Daniel Kokotajlo29 Dec 2020 17:42 UTC

140 points

19 comments14 min readLW link 1 review

microCOVID.org: A tool to estimate COVID risk from common activities

catherio29 Aug 2020 23:01 UTC

169 points

36 comments1 min readLW link 1 review

(microcovid.org)

“Can you keep this confidential? How do you know?”

Raemon21 Jul 2020 0:33 UTC

159 points

41 comments3 min readLW link 2 reviews

Seeing the Smoke

Jacob Falkovich28 Feb 2020 18:26 UTC

198 points

29 comments5 min readLW link 1 review

Interfaces as a Scarce Resource

johnswentworth5 Mar 2020 18:20 UTC

187 points

15 comments12 min readLW link 1 review

All Possible Views About Humanity’s Future Are Wild

HoldenKarnofsky3 Sep 2021 20:19 UTC

146 points

37 comments8 min readLW link 1 review

This Can’t Go On

HoldenKarnofsky18 Sep 2021 23:50 UTC

73 points

55 comments7 min readLW link 2 reviews

Taboo “Outside View”

Daniel Kokotajlo17 Jun 2021 9:36 UTC

348 points

33 comments8 min readLW link 3 reviews

Another (outer) alignment failure story

paulfchristiano7 Apr 2021 20:12 UTC

241 points

38 comments12 min readLW link 1 review

To listen well, get curious

benkuhn13 Dec 2020 0:20 UTC

352 points

37 comments4 min readLW link 1 review

(www.benkuhn.net)

Alignment By Default

johnswentworth12 Aug 2020 18:54 UTC

173 points

94 comments11 min readLW link 2 reviews

Cortés, Pizarro, and Afonso as Precedents for Takeover

Daniel Kokotajlo1 Mar 2020 3:49 UTC

168 points

78 comments11 min readLW link 1 review

Selection Theorems: A Program For Understanding Agents

johnswentworth28 Sep 2021 5:03 UTC

123 points

28 comments6 min readLW link 2 reviews

When Money Is Abundant, Knowledge Is The Real Wealth

johnswentworth3 Nov 2020 17:34 UTC

317 points

61 comments5 min readLW link 3 reviews

CFAR Participant Handbook now available to all

[DEACTIVATED] Duncan Sabien3 Jan 2020 15:43 UTC

248 points

40 comments1 min readLW link 2 reviews

An Orthodox Case Against Utility Functions

abramdemski7 Apr 2020 19:18 UTC

152 points

65 comments8 min readLW link 2 reviews

How To Write Quickly While Maintaining Epistemic Rigor

johnswentworth28 Aug 2021 17:52 UTC

429 points

38 comments4 min readLW link 3 reviews

Motive Ambiguity

Zvi15 Dec 2020 18:10 UTC

172 points

58 comments4 min readLW link 2 reviews

(thezvi.wordpress.com)

Inaccessible information

paulfchristiano3 Jun 2020 5:10 UTC

83 points

17 comments14 min readLW link 2 reviews

(ai-alignment.com)

Discontinuous progress in history: an update

KatjaGrace14 Apr 2020 0:00 UTC

186 points

25 comments31 min readLW link 1 review

(aiimpacts.org)

Frequent arguments about alignment

John Schulman23 Jun 2021 0:46 UTC

99 points

17 comments5 min readLW link

Pain is not the unit of Effort

alkjash24 Nov 2020 20:00 UTC

517 points

89 comments5 min readLW link 2 reviews

(radimentary.wordpress.com)

Radical Probabilism

abramdemski18 Aug 2020 21:14 UTC

176 points

47 comments35 min readLW link 1 review

Slack Has Positive Externalities For Groups

johnswentworth29 Jul 2021 15:03 UTC

90 points

11 comments5 min readLW link 2 reviews

Science in a High-Dimensional World

johnswentworth8 Jan 2021 17:52 UTC

285 points

53 comments7 min readLW link 1 review

The Felt Sense: What, Why and How

Kaj_Sotala5 Oct 2020 15:57 UTC

149 points

23 comments14 min readLW link 1 review

Choosing the Zero Point

orthonormal6 Apr 2020 23:44 UTC

170 points

24 comments3 min readLW link 2 reviews

Rationalism before the Sequences

Eric Raymond30 Mar 2021 14:04 UTC

581 points

81 comments11 min readLW link 2 reviews

Making Vaccine

johnswentworth3 Feb 2021 20:24 UTC

574 points

249 comments6 min readLW link 3 reviews

A Sketch of Good Communication

Ben Pace31 Mar 2018 22:48 UTC

185 points

35 comments3 min readLW link 1 review

Local Validity as a Key to Sanity and Civilization

Eliezer Yudkowsky7 Apr 2018 4:25 UTC

193 points

67 comments13 min readLW link 5 reviews

The Loudest Alarm Is Probably False

orthonormal2 Jan 2018 16:38 UTC

171 points

28 comments2 min readLW link 1 review

Varieties Of Argumentative Experience

Scott Alexander8 May 2018 8:20 UTC

93 points

11 comments18 min readLW link 2 reviews

(slatestarcodex.com)

Babble

alkjash10 Jan 2018 21:56 UTC

195 points

32 comments5 min readLW link 2 reviews

(radimentary.wordpress.com)

Naming the Nameless

sarahconstantin22 Mar 2018 0:35 UTC

119 points

43 comments13 min readLW link 3 reviews

Toolbox-thinking and Law-thinking

Eliezer Yudkowsky31 May 2018 21:28 UTC

160 points

49 comments12 min readLW link

Prune

alkjash12 Jan 2018 22:50 UTC

68 points

10 comments4 min readLW link

(radimentary.wordpress.com)

Towards a New Impact Measure

TurnTrout18 Sep 2018 17:21 UTC

100 points

159 comments33 min readLW link 2 reviews

Being a Robust Agent

Raemon18 Oct 2018 7:00 UTC

145 points

32 comments7 min readLW link 2 reviews

Noticing the Taste of Lotus

Valentine27 Apr 2018 20:05 UTC

203 points

81 comments3 min readLW link 3 reviews

The Tails Coming Apart As Metaphor For Life

Scott Alexander25 Sep 2018 19:10 UTC

155 points

38 comments7 min readLW link 4 reviews

(slatestarcodex.com)

Meta-Honesty: Firming Up Honesty Around Its Edge-Cases

Eliezer Yudkowsky29 May 2018 0:59 UTC

134 points

152 comments27 min readLW link 4 reviews

My attempt to explain Looking, insight meditation, and enlightenment in non-mysterious terms

Kaj_Sotala8 Mar 2018 7:37 UTC

224 points

131 comments17 min readLW link 2 reviews

Anti-social Punishment

Martin Sustrik27 Sep 2018 7:08 UTC

296 points

66 comments6 min readLW link 3 reviews

The Costly Coordination Mechanism of Common Knowledge

Ben Pace15 Mar 2018 20:20 UTC

194 points

31 comments19 min readLW link 2 reviews

The Intelligent Social Web

Valentine22 Feb 2018 18:55 UTC

224 points

112 comments12 min readLW link 2 reviews

Prediction Markets: When Do They Work?

Zvi26 Jul 2018 12:30 UTC

162 points

17 comments10 min readLW link

(thezvi.wordpress.com)

Spaghetti Towers

eukaryote22 Dec 2018 5:29 UTC

187 points

28 comments3 min readLW link 1 review

(eukaryotewritesblog.com)

On the Loss and Preservation of Knowledge

Samo Burja8 Mar 2018 18:40 UTC

66 points

20 comments10 min readLW link

(medium.com)

A voting theory primer for rationalists

Jameson Quinn12 Apr 2018 15:15 UTC

229 points

98 comments17 min readLW link 2 reviews

The Pavlov Strategy

sarahconstantin20 Dec 2018 16:20 UTC

247 points

13 comments4 min readLW link

(srconstantin.wordpress.com)

Inadequate Equilibria vs. Governance of the Commons

Martin Sustrik25 May 2018 13:17 UTC

182 points

17 comments14 min readLW link 2 reviews

Is Science Slowing Down?

Scott Alexander27 Nov 2018 3:30 UTC

125 points

77 comments9 min readLW link 1 review

(slatestarcodex.com)

Research: Rescuers during the Holocaust

Martin Sustrik30 Apr 2018 6:15 UTC

88 points

10 comments9 min readLW link 1 review

An Untrollable Mathematician

abramdemski23 Jan 2018 18:46 UTC

23 points

1 comment3 min readLW link

Why did everything take so long?

KatjaGrace29 Dec 2017 1:00 UTC

33 points

17 comments1 min readLW link

(meteuphoric.wordpress.com)

Is Clickbait Destroying Our General Intelligence?

Eliezer Yudkowsky16 Nov 2018 23:06 UTC

189 points

61 comments5 min readLW link 2 reviews

[Question] What makes people intellectually active?

abramdemski29 Dec 2018 22:29 UTC

116 points

71 comments1 min readLW link

Open question: are minimal circuits daemon-free?

paulfchristiano5 May 2018 22:40 UTC

83 points

70 comments2 min readLW link 1 review

Beyond Astronomical Waste

Wei Dai7 Jun 2018 21:04 UTC

125 points

41 comments3 min readLW link

Historical mathematicians exhibit a birth order effect too

Eli Tyre21 Aug 2018 1:52 UTC

141 points

19 comments6 min readLW link 2 reviews

Birth order effect found in Nobel Laureates in Physics

Bucky4 Sep 2018 12:17 UTC

61 points

25 comments5 min readLW link 1 review

Arguments about fast takeoff

paulfchristiano25 Feb 2018 4:53 UTC

89 points

65 comments2 min readLW link 1 review

(sideways-view.com)

Specification gaming examples in AI

Vika3 Apr 2018 12:30 UTC

45 points

9 comments1 min readLW link 2 reviews

The Rocket Alignment Problem

Eliezer Yudkowsky4 Oct 2018 0:38 UTC

216 points

41 comments15 min readLW link 2 reviews

Embedded Agents

abramdemski and Scott Garrabrant

29 Oct 2018 19:53 UTC

221 points

41 comments1 min readLW link 2 reviews

Paul’s research agenda FAQ

zhukeepa1 Jul 2018 6:25 UTC

126 points

74 comments19 min readLW link 1 review

Challenges to Christiano’s capability amplification proposal

Eliezer Yudkowsky19 May 2018 18:18 UTC

124 points

54 comments23 min readLW link 1 review

Robustness to Scale

Scott Garrabrant21 Feb 2018 22:55 UTC

128 points

23 comments2 min readLW link 1 review

Coherence arguments do not entail goal-directed behavior

Rohin Shah3 Dec 2018 3:26 UTC

123 points

69 comments7 min readLW link 3 reviews

Rule Thinkers In, Not Out

Scott Alexander27 Feb 2019 2:40 UTC

221 points

67 comments4 min readLW link 4 reviews

(slatestarcodex.com)

Gears vs Behavior

johnswentworth19 Sep 2019 6:50 UTC

107 points

13 comments7 min readLW link 1 review

Book Review: The Secret Of Our Success

Scott Alexander5 Jun 2019 6:50 UTC

158 points

19 comments25 min readLW link 2 reviews

(slatestarcodex.com)

Reason isn’t magic

Benquo18 Jun 2019 4:04 UTC

152 points

19 comments2 min readLW link 3 reviews

(benjaminrosshoffman.com)

“Other people are wrong” vs “I am right”

Buck22 Feb 2019 20:01 UTC

246 points

20 comments9 min readLW link 2 reviews

In My Culture

[DEACTIVATED] Duncan Sabien7 Mar 2019 7:22 UTC

66 points

59 comments1 min readLW link 2 reviews

(medium.com)

Chris Olah’s views on AGI safety

evhub1 Nov 2019 20:13 UTC

206 points

38 comments12 min readLW link 2 reviews

Understanding “Deep Double Descent”

evhub6 Dec 2019 0:00 UTC

148 points

51 comments5 min readLW link 4 reviews

How to Ignore Your Emotions (while also thinking you’re awesome at emotions)

Hazard31 Jul 2019 13:34 UTC

352 points

74 comments4 min readLW link 4 reviews

Paper-Reading for Gears

johnswentworth4 Dec 2019 21:02 UTC

159 points

6 comments4 min readLW link 1 review

Book summary: Unlocking the Emotional Brain

Kaj_Sotala8 Oct 2019 19:11 UTC

317 points

48 comments21 min readLW link 3 reviews

Noticing Frame Differences

Raemon30 Sep 2019 1:24 UTC

208 points

39 comments9 min readLW link 2 reviews

Propagating Facts into Aesthetics

Raemon19 Dec 2019 4:09 UTC

110 points

35 comments11 min readLW link 1 review

Do you fear the rock or the hard place?

Ruby20 Jul 2019 22:01 UTC

72 points

10 comments5 min readLW link 3 reviews

Mental Mountains

Scott Alexander27 Nov 2019 5:30 UTC

144 points

14 comments15 min readLW link 1 review

(slatestarcodex.com)

Steelmanning Divination

Vaniver5 Jun 2019 22:53 UTC

191 points

48 comments6 min readLW link 2 reviews

Book Review: Design Principles of Biological Circuits

johnswentworth5 Nov 2019 6:49 UTC

209 points

24 comments12 min readLW link 1 review

Reframing Superintelligence: Comprehensive AI Services as General Intelligence

Rohin Shah8 Jan 2019 7:12 UTC

121 points

77 comments5 min readLW link 2 reviews

(www.fhi.ox.ac.uk)

Building up to an Internal Family Systems model

Kaj_Sotala26 Jan 2019 12:25 UTC

264 points

86 comments28 min readLW link 2 reviews

Being the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC

404 points

57 comments3 min readLW link 3 reviews

The Schelling Choice is “Rabbit”, not “Stag”

Raemon8 Jun 2019 0:24 UTC

157 points

52 comments12 min readLW link 3 reviews

Literature Review: Distributed Teams

Elizabeth16 Apr 2019 1:19 UTC

106 points

37 comments6 min readLW link 1 review

Gears-Level Models are Capital Investments

johnswentworth22 Nov 2019 22:41 UTC

170 points

28 comments7 min readLW link 1 review

Evolution of Modularity

johnswentworth14 Nov 2019 6:49 UTC

174 points

12 comments2 min readLW link 1 review

You Get About Five Words

Raemon12 Mar 2019 20:30 UTC

199 points

77 comments1 min readLW link 6 reviews

Coherent decisions imply consistent utilities

Eliezer Yudkowsky12 May 2019 21:33 UTC

148 points

81 comments26 min readLW link 3 reviews

Alignment Research Field Guide

abramdemski8 Mar 2019 19:57 UTC

264 points

9 comments17 min readLW link 2 reviews

Forum participation as a research strategy

Wei Dai30 Jul 2019 18:09 UTC

151 points

45 comments3 min readLW link 1 review

The Credit Assignment Problem

abramdemski8 Nov 2019 2:50 UTC

98 points

40 comments17 min readLW link 1 review

Asymmetric Justice

Zvi25 Apr 2019 16:00 UTC

230 points

101 comments5 min readLW link 2 reviews

(thezvi.wordpress.com)

Unconscious Economics

jacobjacob27 Feb 2019 12:58 UTC

136 points

30 comments4 min readLW link 3 reviews

Power Buys You Distance From The Crime

Elizabeth2 Aug 2019 20:50 UTC

189 points

75 comments7 min readLW link 1 review

(acesounderglass.com)

Seeking Power is Often Convergently Instrumental in MDPs

TurnTrout and Logan Riggs

5 Dec 2019 2:33 UTC

162 points

39 comments17 min readLW link 2 reviews

(arxiv.org)

Yes Requires the Possibility of No

Scott Garrabrant17 May 2019 22:39 UTC

261 points

55 comments2 min readLW link 2 reviews

Mistakes with Conservation of Expected Evidence

abramdemski8 Jun 2019 23:07 UTC

212 points

25 comments12 min readLW link 1 review

Heads I Win, Tails?—Never Heard of Her; Or, Selective Reporting and the Tragedy of the Green Rationalists

Zack_M_Davis24 Sep 2019 4:12 UTC

299 points

40 comments8 min readLW link 2 reviews

No comments.