All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Luck based medicine: my resentful story of becoming a medical miracle

Elizabeth16 Oct 2022 17:40 UTC

497 points

121 comments12 min readLW link 3 reviews

(acesounderglass.com)

Counterarguments to the basic AI x-risk case

KatjaGrace14 Oct 2022 13:00 UTC

376 points

126 comments34 min readLW link 1 review

(aiimpacts.org)

So, geez there’s a lot of AI content these days

Raemon6 Oct 2022 21:32 UTC

260 points

140 comments6 min readLW link

Introduction to abstract entropy

Alex_Altair20 Oct 2022 21:03 UTC

252 points

78 comments18 min readLW link 1 review

Lessons learned from talking to >100 academics about AI safety

Marius Hobbhahn10 Oct 2022 13:16 UTC

218 points

18 comments12 min readLW link 1 review

What does it take to defend the world against out-of-control AGIs?

Steven Byrnes25 Oct 2022 14:47 UTC

218 points

52 comments30 min readLW link 1 review

Six (and a half) intuitions for KL divergence

CallumMcDougall12 Oct 2022 21:07 UTC

185 points

27 comments10 min readLW link 1 review

(www.perfectlynormal.co.uk)

The Social Recession: By the Numbers

antonomon29 Oct 2022 18:45 UTC

165 points

29 comments8 min readLW link

(novum.substack.com)

Decision theory does not imply that we get to have nice things

So8res18 Oct 2022 3:04 UTC

165 points

76 comments26 min readLW link 2 reviews

Why I think there’s a one-in-six chance of an imminent global nuclear war

Max Tegmark8 Oct 2022 6:26 UTC

164 points

169 comments4 min readLW link

Age changes what you care about

Dentin16 Oct 2022 15:36 UTC

144 points

38 comments2 min readLW link

Why Weren’t Hot Air Balloons Invented Sooner?

Lost Futures18 Oct 2022 0:41 UTC

140 points

52 comments6 min readLW link

(lostfutures.substack.com)

Don’t leave your fingerprints on the future

So8res8 Oct 2022 0:35 UTC

138 points

48 comments5 min readLW link

AI Timelines via Cumulative Optimization Power: Less Long, More Short

jacob_cannell6 Oct 2022 0:21 UTC

138 points

33 comments6 min readLW link

Niceness is unnatural

So8res13 Oct 2022 1:30 UTC

136 points

20 comments8 min readLW link 1 review

Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley

maxnadeau, Xander Davies, Buck and Nate Thomas

27 Oct 2022 1:32 UTC

135 points

14 comments12 min readLW link

Warning Shots Probably Wouldn’t Change The Picture Much

So8res6 Oct 2022 5:15 UTC

132 points

42 comments2 min readLW link

Mnestics

Jarred Filmer23 Oct 2022 0:30 UTC

127 points

6 comments4 min readLW link

Am I secretly excited for AI getting weird?

porby29 Oct 2022 22:16 UTC

116 points

4 comments4 min readLW link

Consider your appetite for disagreements

Adam Zerner8 Oct 2022 23:25 UTC

114 points

18 comments6 min readLW link 1 review

Actually, All Nuclear Famine Papers are Bunk

Lao Mein12 Oct 2022 5:58 UTC

113 points

37 comments2 min readLW link 1 review

That one apocalyptic nuclear famine paper is bunk

Lao Mein12 Oct 2022 3:33 UTC

111 points

10 comments1 min readLW link

Plans Are Predictions, Not Optimization Targets

johnswentworth20 Oct 2022 21:17 UTC

110 points

20 comments4 min readLW link 1 review

Contra shard theory, in the context of the diamond maximizer problem

So8res13 Oct 2022 23:51 UTC

107 points

19 comments2 min readLW link 1 review

The Teacup Test

lsusr8 Oct 2022 4:25 UTC

105 points

32 comments2 min readLW link

Scaling Laws for Reward Model Overoptimization

leogao, John Schulman and Jacob_Hilton

20 Oct 2022 0:20 UTC

103 points

13 comments1 min readLW link

(arxiv.org)

Alignment 201 curriculum

Richard_Ngo12 Oct 2022 18:03 UTC

102 points

3 comments1 min readLW link

(www.agisafetyfundamentals.com)

Analysis: US restricts GPU sales to China

aog7 Oct 2022 18:38 UTC

102 points

58 comments5 min readLW link

«Boundaries», Part 3a: Defining boundaries as directed Markov blankets

Andrew_Critch30 Oct 2022 6:31 UTC

101 points

20 comments15 min readLW link

Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small

RowanWang, Alexandre Variengien, Arthur Conmy, Buck and jsteinhardt

28 Oct 2022 23:55 UTC

101 points

9 comments9 min readLW link 2 reviews

(arxiv.org)

A blog post is a very long and complex search query to find fascinating people and make them route interesting stuff to your inbox

Henrik Karlsson5 Oct 2022 19:07 UTC

99 points

12 comments11 min readLW link

(escapingflatland.substack.com)

How To Make Prediction Markets Useful For Alignment Work

johnswentworth18 Oct 2022 19:01 UTC

97 points

18 comments2 min readLW link

“Normal” is the equilibrium state of past optimization processes

Alex_Altair30 Oct 2022 19:03 UTC

96 points

5 comments5 min readLW link

A shot at the diamond-alignment problem

TurnTrout6 Oct 2022 18:29 UTC

95 points

67 comments15 min readLW link

Transformative VR Is Likely Coming Soon

jimrandomh13 Oct 2022 6:25 UTC

90 points

47 comments2 min readLW link

The heritability of human values: A behavior genetic critique of Shard Theory

geoffreymiller20 Oct 2022 15:51 UTC

89 points

65 comments21 min readLW link

Why Balsa Research is Worthwhile

Zvi10 Oct 2022 13:50 UTC

87 points

12 comments8 min readLW link

(thezvi.wordpress.com)

Polysemanticity and Capacity in Neural Networks

Buck, Adam Jermyn and Kshitij Sachan

7 Oct 2022 17:51 UTC

87 points

14 comments3 min readLW link

I learn better when I frame learning as Vengeance for losses incurred through ignorance, and you might too

chaosmage15 Oct 2022 12:41 UTC

85 points

9 comments3 min readLW link 1 review

Maximal Lotteries

Scott Garrabrant17 Oct 2022 8:54 UTC

85 points

11 comments7 min readLW link

Untapped Potential at 13-18

belkarx18 Oct 2022 18:09 UTC

83 points

53 comments1 min readLW link

More Recent Progress in the Theory of Neural Networks

jylin046 Oct 2022 16:57 UTC

82 points

6 comments4 min readLW link

Paper: Discovering novel algorithms with AlphaTensor [Deepmind]

LawrenceC5 Oct 2022 16:20 UTC

82 points

18 comments1 min readLW link

(www.deepmind.com)

Voting Theory Introduction

Scott Garrabrant17 Oct 2022 8:48 UTC

80 points

8 comments6 min readLW link

The “you-can-just” alarm

Emrik8 Oct 2022 10:43 UTC

80 points

3 comments1 min readLW link

Neural Tangent Kernel Distillation

Thomas Larsen and Jeremy Gillen

5 Oct 2022 18:11 UTC

79 points

20 comments8 min readLW link

Wisdom Cannot Be Unzipped

Sable22 Oct 2022 0:28 UTC

77 points

17 comments7 min readLW link 1 review

(affablyevil.substack.com)

Response to Katja Grace’s AI x-risk counterarguments

Erik Jenner and Johannes Treutlein

19 Oct 2022 1:17 UTC

77 points

18 comments15 min readLW link

Open Problem in Voting Theory

Scott Garrabrant17 Oct 2022 20:42 UTC

75 points

17 comments6 min readLW link

Maximal Lottery-Lotteries

Scott Garrabrant17 Oct 2022 20:39 UTC

74 points

15 comments4 min readLW link