Best of LessWrong

TagLast edit: 9 Feb 2023 2:01 UTC by Raemon

Why did everything take so long?

KatjaGrace29 Dec 2017 1:00 UTC

33 points

17 comments1 min readLW link

(meteuphoric.wordpress.com)

The Loudest Alarm Is Probably False

orthonormal2 Jan 2018 16:38 UTC

171 points

28 comments2 min readLW link 1 review

Babble

alkjash10 Jan 2018 21:56 UTC

195 points

32 comments5 min readLW link 2 reviews

(radimentary.wordpress.com)

Prune

alkjash12 Jan 2018 22:50 UTC

68 points

10 comments4 min readLW link

(radimentary.wordpress.com)

An Untrollable Mathematician

abramdemski23 Jan 2018 18:46 UTC

23 points

1 comment3 min readLW link

Robustness to Scale

Scott Garrabrant21 Feb 2018 22:55 UTC

128 points

23 comments2 min readLW link 1 review

The Intelligent Social Web

Valentine22 Feb 2018 18:55 UTC

224 points

112 comments12 min readLW link 2 reviews

Arguments about fast takeoff

paulfchristiano25 Feb 2018 4:53 UTC

89 points

65 comments2 min readLW link 1 review

(sideways-view.com)

My attempt to explain Looking, insight meditation, and enlightenment in non-mysterious terms

Kaj_Sotala8 Mar 2018 7:37 UTC

223 points

131 comments17 min readLW link 2 reviews

On the Loss and Preservation of Knowledge

Samo Burja8 Mar 2018 18:40 UTC

66 points

20 comments10 min readLW link

(medium.com)

The Costly Coordination Mechanism of Common Knowledge

Ben Pace15 Mar 2018 20:20 UTC

194 points

31 comments19 min readLW link 2 reviews

Naming the Nameless

sarahconstantin22 Mar 2018 0:35 UTC

119 points

43 comments13 min readLW link 3 reviews

A Sketch of Good Communication

Ben Pace31 Mar 2018 22:48 UTC

185 points

35 comments3 min readLW link 1 review

Specification gaming examples in AI

Vika3 Apr 2018 12:30 UTC

45 points

9 comments1 min readLW link 2 reviews

Local Validity as a Key to Sanity and Civilization

Eliezer Yudkowsky7 Apr 2018 4:25 UTC

193 points

67 comments13 min readLW link 5 reviews

A voting theory primer for rationalists

Jameson Quinn12 Apr 2018 15:15 UTC

229 points

98 comments17 min readLW link 2 reviews

Noticing the Taste of Lotus

Valentine27 Apr 2018 20:05 UTC

203 points

81 comments3 min readLW link 3 reviews

Research: Rescuers during the Holocaust

Martin Sustrik30 Apr 2018 6:15 UTC

88 points

10 comments9 min readLW link 1 review

Open question: are minimal circuits daemon-free?

paulfchristiano5 May 2018 22:40 UTC

83 points

70 comments2 min readLW link 1 review

Varieties Of Argumentative Experience

Scott Alexander8 May 2018 8:20 UTC

93 points

11 comments18 min readLW link 2 reviews

(slatestarcodex.com)

Challenges to Christiano’s capability amplification proposal

Eliezer Yudkowsky19 May 2018 18:18 UTC

124 points

54 comments23 min readLW link 1 review

Inadequate Equilibria vs. Governance of the Commons

Martin Sustrik25 May 2018 13:17 UTC

182 points

17 comments14 min readLW link 2 reviews

Meta-Honesty: Firming Up Honesty Around Its Edge-Cases

Eliezer Yudkowsky29 May 2018 0:59 UTC

134 points

152 comments27 min readLW link 4 reviews

Toolbox-thinking and Law-thinking

Eliezer Yudkowsky31 May 2018 21:28 UTC

160 points

49 comments12 min readLW link

Beyond Astronomical Waste

Wei Dai7 Jun 2018 21:04 UTC

125 points

41 comments3 min readLW link

Paul’s research agenda FAQ

zhukeepa1 Jul 2018 6:25 UTC

126 points

74 comments19 min readLW link 1 review

Prediction Markets: When Do They Work?

Zvi26 Jul 2018 12:30 UTC

162 points

17 comments10 min readLW link

(thezvi.wordpress.com)

Historical mathematicians exhibit a birth order effect too

Eli Tyre21 Aug 2018 1:52 UTC

141 points

19 comments6 min readLW link 2 reviews

Birth order effect found in Nobel Laureates in Physics

Bucky4 Sep 2018 12:17 UTC

61 points

25 comments5 min readLW link 1 review

Towards a New Impact Measure

TurnTrout18 Sep 2018 17:21 UTC

100 points

159 comments33 min readLW link 2 reviews

The Tails Coming Apart As Metaphor For Life

Scott Alexander25 Sep 2018 19:10 UTC

155 points

38 comments7 min readLW link 4 reviews

(slatestarcodex.com)

Anti-social Punishment

Martin Sustrik27 Sep 2018 7:08 UTC

296 points

66 comments6 min readLW link 3 reviews

The Rocket Alignment Problem

Eliezer Yudkowsky4 Oct 2018 0:38 UTC

216 points

41 comments15 min readLW link 2 reviews

Being a Robust Agent

Raemon18 Oct 2018 7:00 UTC

145 points

32 comments7 min readLW link 2 reviews

Embedded Agents

abramdemski and Scott Garrabrant

29 Oct 2018 19:53 UTC

221 points

41 comments1 min readLW link 2 reviews

Is Clickbait Destroying Our General Intelligence?

Eliezer Yudkowsky16 Nov 2018 23:06 UTC

189 points

61 comments5 min readLW link 2 reviews

Is Science Slowing Down?

Scott Alexander27 Nov 2018 3:30 UTC

125 points

77 comments9 min readLW link 1 review

(slatestarcodex.com)

Coherence arguments do not entail goal-directed behavior

Rohin Shah3 Dec 2018 3:26 UTC

123 points

69 comments7 min readLW link 3 reviews

The Pavlov Strategy

sarahconstantin20 Dec 2018 16:20 UTC

247 points

13 comments4 min readLW link

(srconstantin.wordpress.com)

Spaghetti Towers

eukaryote22 Dec 2018 5:29 UTC

187 points

28 comments3 min readLW link 1 review

(eukaryotewritesblog.com)

[Question] What makes people intellectually active?

abramdemski29 Dec 2018 22:29 UTC

116 points

71 comments1 min readLW link

Reframing Superintelligence: Comprehensive AI Services as General Intelligence

Rohin Shah8 Jan 2019 7:12 UTC

121 points

77 comments5 min readLW link 2 reviews

(www.fhi.ox.ac.uk)

Building up to an Internal Family Systems model

Kaj_Sotala26 Jan 2019 12:25 UTC

264 points

86 comments28 min readLW link 2 reviews

“Other people are wrong” vs “I am right”

Buck22 Feb 2019 20:01 UTC

246 points

20 comments9 min readLW link 2 reviews

Rule Thinkers In, Not Out

Scott Alexander27 Feb 2019 2:40 UTC

221 points

67 comments4 min readLW link 4 reviews

(slatestarcodex.com)

Unconscious Economics

jacobjacob27 Feb 2019 12:58 UTC

136 points

30 comments4 min readLW link 3 reviews

In My Culture

[DEACTIVATED] Duncan Sabien7 Mar 2019 7:22 UTC

66 points

59 comments1 min readLW link 2 reviews

(medium.com)

Alignment Research Field Guide

abramdemski8 Mar 2019 19:57 UTC

264 points

9 comments17 min readLW link 2 reviews

You Get About Five Words

Raemon12 Mar 2019 20:30 UTC

199 points

76 comments1 min readLW link 6 reviews

Literature Review: Distributed Teams

Elizabeth16 Apr 2019 1:19 UTC

106 points

37 comments6 min readLW link 1 review

Asymmetric Justice

Zvi25 Apr 2019 16:00 UTC

230 points

101 comments5 min readLW link 2 reviews

(thezvi.wordpress.com)

Coherent decisions imply consistent utilities

Eliezer Yudkowsky12 May 2019 21:33 UTC

148 points

81 comments26 min readLW link 3 reviews

Yes Requires the Possibility of No

Scott Garrabrant17 May 2019 22:39 UTC

261 points

55 comments2 min readLW link 2 reviews

Book Review: The Secret Of Our Success

Scott Alexander5 Jun 2019 6:50 UTC

158 points

19 comments25 min readLW link 2 reviews

(slatestarcodex.com)

Steelmanning Divination

Vaniver5 Jun 2019 22:53 UTC

191 points

48 comments6 min readLW link 2 reviews

The Schelling Choice is “Rabbit”, not “Stag”

Raemon8 Jun 2019 0:24 UTC

157 points

52 comments12 min readLW link 3 reviews

Mistakes with Conservation of Expected Evidence

abramdemski8 Jun 2019 23:07 UTC

212 points

25 comments12 min readLW link 1 review

Reason isn’t magic

Benquo18 Jun 2019 4:04 UTC

152 points

19 comments2 min readLW link 3 reviews

(benjaminrosshoffman.com)

Being the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC

402 points

57 comments3 min readLW link 3 reviews

Do you fear the rock or the hard place?

Ruby20 Jul 2019 22:01 UTC

72 points

10 comments5 min readLW link 3 reviews

Forum participation as a research strategy

Wei Dai30 Jul 2019 18:09 UTC

151 points

45 comments3 min readLW link 1 review

How to Ignore Your Emotions (while also thinking you’re awesome at emotions)

Hazard31 Jul 2019 13:34 UTC

351 points

74 comments4 min readLW link 4 reviews

Power Buys You Distance From The Crime

Elizabeth2 Aug 2019 20:50 UTC

189 points

75 comments7 min readLW link 1 review

(acesounderglass.com)

Gears vs Behavior

johnswentworth19 Sep 2019 6:50 UTC

107 points

13 comments7 min readLW link 1 review

Heads I Win, Tails?—Never Heard of Her; Or, Selective Reporting and the Tragedy of the Green Rationalists

Zack_M_Davis24 Sep 2019 4:12 UTC

299 points

40 comments8 min readLW link 2 reviews

Noticing Frame Differences

Raemon30 Sep 2019 1:24 UTC

208 points

39 comments9 min readLW link 2 reviews

Book summary: Unlocking the Emotional Brain

Kaj_Sotala8 Oct 2019 19:11 UTC

316 points

48 comments21 min readLW link 3 reviews

Chris Olah’s views on AGI safety

evhub1 Nov 2019 20:13 UTC

206 points

38 comments12 min readLW link 2 reviews

Book Review: Design Principles of Biological Circuits

johnswentworth5 Nov 2019 6:49 UTC

209 points

24 comments12 min readLW link 1 review

The Credit Assignment Problem

abramdemski8 Nov 2019 2:50 UTC

98 points

40 comments17 min readLW link 1 review

Evolution of Modularity

johnswentworth14 Nov 2019 6:49 UTC

174 points

12 comments2 min readLW link 1 review

Gears-Level Models are Capital Investments

johnswentworth22 Nov 2019 22:41 UTC

170 points

28 comments7 min readLW link 1 review

Mental Mountains

Scott Alexander27 Nov 2019 5:30 UTC

144 points

14 comments15 min readLW link 1 review

(slatestarcodex.com)

Paper-Reading for Gears

johnswentworth4 Dec 2019 21:02 UTC

159 points

6 comments4 min readLW link 1 review

Seeking Power is Often Convergently Instrumental in MDPs

TurnTrout and Logan Riggs

5 Dec 2019 2:33 UTC

162 points

39 comments17 min readLW link 2 reviews

(arxiv.org)

Understanding “Deep Double Descent”

evhub6 Dec 2019 0:00 UTC

148 points

51 comments5 min readLW link 4 reviews

Propagating Facts into Aesthetics

Raemon19 Dec 2019 4:09 UTC

109 points

35 comments11 min readLW link 1 review

What cognitive biases feel like from the inside

chaosmage3 Jan 2020 14:24 UTC

249 points

32 comments4 min readLW link

CFAR Participant Handbook now available to all

[DEACTIVATED] Duncan Sabien3 Jan 2020 15:43 UTC

248 points

40 comments1 min readLW link 2 reviews

Reality-Revealing and Reality-Masking Puzzles

AnnaSalamon16 Jan 2020 16:15 UTC

258 points

57 comments13 min readLW link 1 review

The Road to Mazedom

Zvi18 Jan 2020 14:10 UTC

94 points

25 comments7 min readLW link 2 reviews

(thezvi.wordpress.com)

Coordination as a Scarce Resource

johnswentworth25 Jan 2020 23:32 UTC

231 points

22 comments4 min readLW link 2 reviews

What Money Cannot Buy

johnswentworth1 Feb 2020 20:11 UTC

318 points

49 comments4 min readLW link 1 review

Seeing the Smoke

Jacob Falkovich28 Feb 2020 18:26 UTC

198 points

29 comments5 min readLW link 1 review

Cortés, Pizarro, and Afonso as Precedents for Takeover

Daniel Kokotajlo1 Mar 2020 3:49 UTC

168 points

78 comments11 min readLW link 1 review

Interfaces as a Scarce Resource

johnswentworth5 Mar 2020 18:20 UTC

187 points

15 comments12 min readLW link 1 review

Credibility of the CDC on SARS-CoV-2

Elizabeth and jimrandomh

7 Mar 2020 2:00 UTC

226 points

119 comments6 min readLW link 1 review

Can crimes be discussed literally?

Benquo22 Mar 2020 20:17 UTC

102 points

38 comments2 min readLW link 3 reviews

(benjaminrosshoffman.com)

Transportation as a Constraint

johnswentworth6 Apr 2020 4:58 UTC

176 points

32 comments6 min readLW link 1 review

Choosing the Zero Point

orthonormal6 Apr 2020 23:44 UTC

170 points

24 comments3 min readLW link 2 reviews

An Orthodox Case Against Utility Functions

abramdemski7 Apr 2020 19:18 UTC

152 points

65 comments8 min readLW link 2 reviews

Discontinuous progress in history: an update

KatjaGrace14 Apr 2020 0:00 UTC

186 points

25 comments31 min readLW link 1 review

(aiimpacts.org)

How uniform is the neocortex?

zhukeepa4 May 2020 2:16 UTC

79 points

23 comments11 min readLW link 1 review

A non-mystical explanation of “no-self” (three characteristics series)

Kaj_Sotala8 May 2020 10:37 UTC

105 points

65 comments20 min readLW link 1 review

Studies On Slack

Scott Alexander13 May 2020 5:00 UTC

151 points

34 comments24 min readLW link 1 review

(slatestarcodex.com)

An overview of 11 proposals for building safe advanced AI

evhub29 May 2020 20:38 UTC

205 points

36 comments38 min readLW link 2 reviews

Covid-19: My Current Model

Zvi31 May 2020 17:40 UTC

188 points

74 comments19 min readLW link 1 review

(thezvi.wordpress.com)

Inaccessible information

paulfchristiano3 Jun 2020 5:10 UTC

83 points

17 comments14 min readLW link 2 reviews

(ai-alignment.com)

Simulacra Levels and their Interactions

Zvi15 Jun 2020 13:10 UTC

197 points

50 comments17 min readLW link 1 review

(thezvi.wordpress.com)

The ground of optimization

Alex Flint20 Jun 2020 0:38 UTC

245 points

80 comments27 min readLW link 1 review

Swiss Political System: More than You ever Wanted to Know (I.)

Martin Sustrik19 Jul 2020 1:11 UTC

171 points

39 comments24 min readLW link 2 reviews

“Can you keep this confidential? How do you know?”

Raemon21 Jul 2020 0:33 UTC

159 points

41 comments3 min readLW link 2 reviews

Inner Alignment: Explain like I’m 12 Edition

Rafael Harth1 Aug 2020 15:24 UTC

179 points

46 comments13 min readLW link 2 reviews

Alignment By Default

johnswentworth12 Aug 2020 18:54 UTC

173 points

94 comments11 min readLW link 2 reviews

Search versus design

Alex Flint16 Aug 2020 16:53 UTC

100 points

40 comments36 min readLW link 1 review

Why haven’t we celebrated any major achievements lately?

jasoncrawford17 Aug 2020 20:34 UTC

194 points

69 comments12 min readLW link 2 reviews

(rootsofprogress.org)

Radical Probabilism

abramdemski18 Aug 2020 21:14 UTC

176 points

47 comments35 min readLW link 1 review

Introduction To The Infra-Bayesianism Sequence

Diffractor and Vanessa Kosoy

26 Aug 2020 20:31 UTC

108 points

62 comments14 min readLW link 2 reviews

microCOVID.org: A tool to estimate COVID risk from common activities

catherio29 Aug 2020 23:01 UTC

169 points

36 comments1 min readLW link 1 review

(microcovid.org)

My computational framework for the brain

Steven Byrnes14 Sep 2020 14:19 UTC

150 points

26 comments13 min readLW link 1 review

Most Prisoner’s Dilemmas are Stag Hunts; Most Stag Hunts are Schelling Problems

abramdemski14 Sep 2020 22:13 UTC

177 points

36 comments10 min readLW link 3 reviews

Draft report on AI timelines

Ajeya Cotra18 Sep 2020 23:47 UTC

214 points

56 comments1 min readLW link 1 review

AGI safety from first principles: Introduction

Richard_Ngo28 Sep 2020 19:53 UTC

121 points

18 comments2 min readLW link 1 review

The Felt Sense: What, Why and How

Kaj_Sotala5 Oct 2020 15:57 UTC

149 points

23 comments14 min readLW link 1 review

The Alignment Problem: Machine Learning and Human Values

Rohin Shah6 Oct 2020 17:41 UTC

120 points

7 comments6 min readLW link 1 review

(www.amazon.com)

The Treacherous Path to Rationality

Jacob Falkovich9 Oct 2020 15:34 UTC

204 points

115 comments11 min readLW link 1 review

The Solomonoff Prior is Malign

Mark Xu14 Oct 2020 1:33 UTC

168 points

52 comments16 min readLW link 3 reviews

The date of AI Takeover is not the day the AI takes over

Daniel Kokotajlo22 Oct 2020 10:41 UTC

145 points

32 comments2 min readLW link 1 review

Introduction to Cartesian Frames

Scott Garrabrant22 Oct 2020 13:00 UTC

153 points

32 comments22 min readLW link 1 review

Is Success the Enemy of Freedom? (Full)

alkjash26 Oct 2020 20:25 UTC

291 points

68 comments9 min readLW link 1 review

(radimentary.wordpress.com)

When Money Is Abundant, Knowledge Is The Real Wealth

johnswentworth3 Nov 2020 17:34 UTC

317 points

61 comments5 min readLW link 3 reviews

Nuclear war is unlikely to cause human extinction

Jeffrey Ladish7 Nov 2020 5:42 UTC

124 points

47 comments11 min readLW link 3 reviews

The Pointers Problem: Human Values Are A Function Of Humans’ Latent Variables

johnswentworth18 Nov 2020 17:47 UTC

123 points

49 comments11 min readLW link 2 reviews

Some AI research areas and their relevance to existential safety

Andrew_Critch19 Nov 2020 3:18 UTC

204 points

37 comments50 min readLW link 2 reviews

Pain is not the unit of Effort

alkjash24 Nov 2020 20:00 UTC

517 points

89 comments5 min readLW link 2 reviews

(radimentary.wordpress.com)

To listen well, get curious

benkuhn13 Dec 2020 0:20 UTC

351 points

37 comments4 min readLW link 1 review

(www.benkuhn.net)

Motive Ambiguity

Zvi15 Dec 2020 18:10 UTC

172 points

58 comments4 min readLW link 2 reviews

(thezvi.wordpress.com)

The First Sample Gives the Most Information

Mark Xu24 Dec 2020 20:39 UTC

132 points

16 comments1 min readLW link 1 review

(markxu.com)

Why Neural Networks Generalise, and Why They Are (Kind of) Bayesian

Joar Skalse29 Dec 2020 13:33 UTC

74 points

58 comments1 min readLW link 1 review

Against GDP as a metric for timelines and takeoff speeds

Daniel Kokotajlo29 Dec 2020 17:42 UTC

140 points

19 comments14 min readLW link 1 review

Anti-Aging: State of the Art

JackH31 Dec 2020 19:07 UTC

371 points

176 comments11 min readLW link 1 review

Cryonics signup guide #1: Overview

mingyuan6 Jan 2021 0:25 UTC

150 points

33 comments6 min readLW link 1 review

Science in a High-Dimensional World

johnswentworth8 Jan 2021 17:52 UTC

285 points

53 comments7 min readLW link 1 review

Leaky Delegation: You are not a Commodity

Darmani25 Jan 2021 2:04 UTC

297 points

34 comments15 min readLW link 1 review

Simulacrum 3 As Stag-Hunt Strategy

johnswentworth26 Jan 2021 19:40 UTC

179 points

37 comments4 min readLW link 3 reviews

Catching the Spark

LoganStrohl30 Jan 2021 23:23 UTC

111 points

21 comments36 min readLW link 1 review

Elephant seal 2

KatjaGrace2 Feb 2021 9:40 UTC

57 points

5 comments1 min readLW link 2 reviews

(worldspiritsockpuppet.com)

Making Vaccine

johnswentworth3 Feb 2021 20:24 UTC

574 points

249 comments6 min readLW link 3 reviews

Your Cheerful Price

Eliezer Yudkowsky13 Feb 2021 5:41 UTC

262 points

82 comments17 min readLW link 6 reviews

“PR” is corrosive; “reputation” is not.

AnnaSalamon14 Feb 2021 3:32 UTC

307 points

93 comments2 min readLW link 3 reviews

Utility Maximization = Description Length Minimization

johnswentworth18 Feb 2021 18:04 UTC

208 points

44 comments5 min readLW link

Fun with +12 OOMs of Compute

Daniel Kokotajlo1 Mar 2021 13:30 UTC

224 points

86 comments12 min readLW link 2 reviews

Seven Years of Spaced Repetition Software in the Classroom

tanagrabeast4 Mar 2021 2:42 UTC

265 points

38 comments34 min readLW link 1 review

Trapped Priors As A Basic Problem Of Rationality

Scott Alexander12 Mar 2021 20:02 UTC

141 points

32 comments14 min readLW link 3 reviews

Strong Evidence is Common

Mark Xu13 Mar 2021 22:04 UTC

244 points

49 comments1 min readLW link 4 reviews

(markxu.com)

Jean Monnet: The Guerilla Bureaucrat

Martin Sustrik20 Mar 2021 10:37 UTC

175 points

25 comments18 min readLW link 1 review

My research methodology

paulfchristiano22 Mar 2021 21:20 UTC

159 points

38 comments16 min readLW link 1 review

(ai-alignment.com)

Rationalism before the Sequences

Eric Raymond30 Mar 2021 14:04 UTC

581 points

81 comments11 min readLW link 2 reviews

What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

Andrew_Critch31 Mar 2021 23:50 UTC

272 points

64 comments22 min readLW link 1 review

Notes from “Don’t Shoot the Dog”

juliawise2 Apr 2021 16:34 UTC

244 points

11 comments12 min readLW link 1 review

Another (outer) alignment failure story

paulfchristiano7 Apr 2021 20:12 UTC

241 points

38 comments12 min readLW link 1 review

Highlights from The Autobiography of Andrew Carnegie

jasoncrawford8 Apr 2021 22:03 UTC

92 points

9 comments19 min readLW link 1 review

(rootsofprogress.org)

Specializing in Problems We Don’t Understand

johnswentworth10 Apr 2021 22:40 UTC

159 points

29 comments8 min readLW link 1 review

There’s no such thing as a tree (phylogenetically)

eukaryote3 May 2021 3:47 UTC

333 points

58 comments8 min readLW link 2 reviews

(eukaryotewritesblog.com)

Saving Time

Scott Garrabrant18 May 2021 20:11 UTC

156 points

20 comments4 min readLW link 1 review

Finite Factored Sets

Scott Garrabrant23 May 2021 20:52 UTC

146 points

95 comments24 min readLW link 1 review

Taboo “Outside View”

Daniel Kokotajlo17 Jun 2021 9:36 UTC

348 points

33 comments8 min readLW link 3 reviews

The Point of Trade

Eliezer Yudkowsky22 Jun 2021 17:56 UTC

171 points

76 comments4 min readLW link 1 review

Frequent arguments about alignment

John Schulman23 Jun 2021 0:46 UTC

99 points

17 comments5 min readLW link

Slack Has Positive Externalities For Groups

johnswentworth29 Jul 2021 15:03 UTC

90 points

11 comments5 min readLW link 2 reviews

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC

473 points

150 comments16 min readLW link 1 review

The Death of Behavioral Economics

habryka22 Aug 2021 22:39 UTC

153 points

24 comments1 min readLW link 2 reviews

(www.thebehavioralscientist.com)

How To Write Quickly While Maintaining Epistemic Rigor

johnswentworth28 Aug 2021 17:52 UTC

428 points

38 comments4 min readLW link 3 reviews

Grokking the Intentional Stance

jbkjr31 Aug 2021 15:49 UTC

43 points

22 comments20 min readLW link 1 review

All Possible Views About Humanity’s Future Are Wild

HoldenKarnofsky3 Sep 2021 20:19 UTC

146 points

37 comments8 min readLW link 1 review

How factories were made safe

jasoncrawford12 Sep 2021 19:58 UTC

181 points

46 comments18 min readLW link 1 review

(rootsofprogress.org)

This Can’t Go On

HoldenKarnofsky18 Sep 2021 23:50 UTC

73 points

55 comments7 min readLW link 2 reviews

Selection Theorems: A Program For Understanding Agents

johnswentworth28 Sep 2021 5:03 UTC

123 points

28 comments6 min readLW link 2 reviews

What Do GDP Growth Curves Really Mean?

johnswentworth7 Oct 2021 21:58 UTC

219 points

64 comments8 min readLW link 2 reviews

Shoulder Advisors 101

[DEACTIVATED] Duncan Sabien9 Oct 2021 5:30 UTC

193 points

124 comments14 min readLW link 2 reviews

Cup-Stacking Skills (or, Reflexive Involuntary Mental Motions)

[DEACTIVATED] Duncan Sabien11 Oct 2021 7:16 UTC

117 points

36 comments7 min readLW link 2 reviews

Lies, Damn Lies, and Fabricated Options

[DEACTIVATED] Duncan Sabien17 Oct 2021 2:47 UTC

288 points

132 comments14 min readLW link 2 reviews

Self-Integrity and the Drowning Child

Eliezer Yudkowsky24 Oct 2021 20:57 UTC

329 points

85 comments5 min readLW link 1 review

Ruling Out Everything Else

[DEACTIVATED] Duncan Sabien27 Oct 2021 21:50 UTC

190 points

51 comments21 min readLW link 2 reviews

Feature Selection

Zack_M_Davis1 Nov 2021 0:22 UTC

315 points

24 comments16 min readLW link 1 review

Comments on Carlsmith’s “Is power-seeking AI an existential risk?”

So8res13 Nov 2021 4:29 UTC

138 points

14 comments40 min readLW link 1 review

You are probably underestimating how good self-love can be

Charlie Rogers-Smith14 Nov 2021 0:41 UTC

145 points

19 comments12 min readLW link 1 review

Ngo and Yudkowsky on alignment difficulty

Eliezer Yudkowsky and Richard_Ngo

15 Nov 2021 20:31 UTC

250 points

148 comments99 min readLW link 1 review

Split and Commit

[DEACTIVATED] Duncan Sabien21 Nov 2021 6:27 UTC

178 points

33 comments7 min readLW link 1 review

EfficientZero: How It Works

1a3orn26 Nov 2021 15:17 UTC

292 points

50 comments29 min readLW link 1 review

Frame Control

Aella27 Nov 2021 22:59 UTC

314 points

282 comments23 min readLW link 2 reviews

The Rationalists of the 1950s (and before) also called themselves “Rationalists”

Owain_Evans28 Nov 2021 20:17 UTC

187 points

30 comments3 min readLW link 1 review

Lars Doucet’s Georgism series on Astral Codex Ten

Sune4 Dec 2021 19:43 UTC

13 points

2 comments1 min readLW link 1 review

(astralcodexten.substack.com)

The Plan

johnswentworth10 Dec 2021 23:41 UTC

254 points

78 comments14 min readLW link 1 review

ARC’s first technical report: Eliciting Latent Knowledge

paulfchristiano, Mark Xu and Ajeya Cotra

14 Dec 2021 20:09 UTC

225 points

90 comments1 min readLW link 3 reviews

(docs.google.com)

Worst-case thinking in AI alignment

Buck23 Dec 2021 1:29 UTC

162 points

18 comments6 min readLW link 2 reviews

No comments.