All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

GPTs are Predictors, not Imitators

Eliezer Yudkowsky8 Apr 2023 19:59 UTC

424 points

100 comments3 min readLW link 3 reviews

LW Team is adjusting moderation policy

Raemon4 Apr 2023 20:41 UTC

305 points

185 comments3 min readLW link

Notes on Teaching in Prison

jsd19 Apr 2023 1:53 UTC

294 points

13 comments12 min readLW link

Hooray for stepping out of the limelight

So8res1 Apr 2023 2:45 UTC

284 points

26 comments1 min readLW link

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down

Eliezer Yudkowsky8 Apr 2023 0:36 UTC

268 points

44 comments12 min readLW link 1 review

[SEE NEW EDITS] No, You Need to Write Clearer

Nicholas Kross29 Apr 2023 5:04 UTC

263 points

65 comments5 min readLW link

(www.thinkingmuchbetter.com)

My Assessment of the Chinese AI Safety Community

Lao Mein25 Apr 2023 4:21 UTC

254 points

95 comments3 min readLW link

My views on “doom”

paulfchristiano27 Apr 2023 17:50 UTC

252 points

38 comments2 min readLW link 1 review

(ai-alignment.com)

On AutoGPT

Zvi13 Apr 2023 12:30 UTC

248 points

47 comments20 min readLW link

(thezvi.wordpress.com)

Policy discussions follow strong contextualizing norms

Richard_Ngo1 Apr 2023 23:51 UTC

231 points

61 comments3 min readLW link

Catching the Eye of Sauron

Casey_7 Apr 2023 0:40 UTC

221 points

68 comments4 min readLW link

Eliezer Yudkowsky’s Letter in Time Magazine

Zvi5 Apr 2023 18:00 UTC

217 points

86 comments14 min readLW link

(thezvi.wordpress.com)

Orthogonal: A new agent foundations alignment organization

Tamsin Leake19 Apr 2023 20:17 UTC

217 points

4 comments1 min readLW link

(orxl.org)

Evolution provides no evidence for the sharp left turn

Quintin Pope11 Apr 2023 18:43 UTC

214 points

65 comments15 min readLW link 1 review

Giant (In)scrutable Matrices: (Maybe) the Best of All Possible Worlds

1a3orn4 Apr 2023 17:39 UTC

214 points

38 comments5 min readLW link 1 review

Killing Socrates

Duncan Sabien (Inactive)11 Apr 2023 10:28 UTC

212 points

147 comments8 min readLW link 1 review

If interpretability research goes well, it may get dangerous

So8res3 Apr 2023 21:48 UTC

202 points

11 comments2 min readLW link

The ‘ petertodd’ phenomenon

mwatkins15 Apr 2023 0:59 UTC

192 points

52 comments38 min readLW link 1 review

Transcript and Brief Response to Twitter Conversation between Yann LeCunn and Eliezer Yudkowsky

Zvi26 Apr 2023 13:10 UTC

190 points

51 comments10 min readLW link

(thezvi.wordpress.com)

The basic reasons I expect AGI ruin

Rob Bensinger18 Apr 2023 3:37 UTC

187 points

74 comments14 min readLW link

A report about LessWrong karma volatility from a different universe

Ben Pace1 Apr 2023 21:48 UTC

182 points

7 comments1 min readLW link

Tuning your Cognitive Strategies

Raemon and SquirrelInHell

27 Apr 2023 20:32 UTC

182 points

59 comments9 min readLW link 1 review

(bewelltuned.com)

Talking publicly about AI risk

Jan_Kulveit21 Apr 2023 11:28 UTC

180 points

9 comments6 min readLW link

[April Fools’] Definitive confirmation of shard theory

TurnTrout1 Apr 2023 7:27 UTC

170 points

8 comments2 min readLW link

The Brain is Not Close to Thermodynamic Limits on Computation

DaemonicSigil24 Apr 2023 8:21 UTC

167 points

58 comments5 min readLW link

Davidad’s Bold Plan for Alignment: An In-Depth Explanation

Charbel-Raphaël and Gabin

19 Apr 2023 16:09 UTC

167 points

40 comments21 min readLW link 2 reviews

AI doom from an LLM-plateau-ist perspective

Steven Byrnes27 Apr 2023 13:58 UTC

164 points

24 comments6 min readLW link

Agentized LLMs will change the alignment landscape

Seth Herd9 Apr 2023 2:29 UTC

160 points

102 comments3 min readLW link 1 review

grey goo is unlikely

bhauth17 Apr 2023 1:59 UTC

156 points

123 comments9 min readLW link 2 reviews

(bhauth.com)

The self-unalignment problem

Jan_Kulveit and rosehadshar

14 Apr 2023 12:10 UTC

155 points

24 comments10 min readLW link

A freshman year during the AI midgame: my approach to the next year

Buck14 Apr 2023 0:38 UTC

154 points

15 comments7 min readLW link 1 review

AI x-risk, approximately ordered by embarrassment

Alex Lawsen 12 Apr 2023 23:01 UTC

151 points

7 comments19 min readLW link

Could a superintelligence deduce general relativity from a falling apple? An investigation

titotal23 Apr 2023 12:49 UTC

150 points

39 comments9 min readLW link

Shutting down AI is not enough. We need to destroy all technology.

Matthew Barnett1 Apr 2023 21:03 UTC

145 points

36 comments1 min readLW link

The Learning-Theoretic Agenda: Status 2023

Vanessa Kosoy19 Apr 2023 5:21 UTC

144 points

22 comments56 min readLW link 3 reviews

Consider The Hand Axe

ymeskhout8 Apr 2023 1:31 UTC

143 points

16 comments6 min readLW link

Request to AGI organizations: Share your views on pausing AI progress

Orpheus16 and simeon_c

11 Apr 2023 17:30 UTC

141 points

11 comments1 min readLW link

But why would the AI kill us?

So8res17 Apr 2023 18:42 UTC

141 points

96 comments2 min readLW link

Four mindset disagreements behind existential risk disagreements in ML

Rob Bensinger11 Apr 2023 4:53 UTC

137 points

12 comments9 min readLW link

AI Summer Harvest

Cleo Nardo4 Apr 2023 3:35 UTC

130 points

10 comments1 min readLW link

Misgeneralization as a misnomer

So8res6 Apr 2023 20:43 UTC

128 points

22 comments4 min readLW link

Goodhart’s Law inside the human mind

Kaj_Sotala17 Apr 2023 13:48 UTC

127 points

13 comments16 min readLW link

$250 prize for checking Jake Cannell’s Brain Efficiency

Alexander Gietelink Oldenziel26 Apr 2023 16:21 UTC

123 points

170 comments2 min readLW link

[New LW Feature] “Debates”

Ruby, RobertM, GPT-4 and Claude+

1 Apr 2023 7:00 UTC

121 points

35 comments1 min readLW link

Deep learning models might be secretly (almost) linear

beren24 Apr 2023 18:43 UTC

117 points

29 comments4 min readLW link

How could you possibly choose what an AI wants?

So8res19 Apr 2023 17:08 UTC

109 points

19 comments1 min readLW link

Why Simulator AIs want to be Active Inference AIs

Jan_Kulveit and rosehadshar

10 Apr 2023 18:23 UTC

107 points

9 comments8 min readLW link 1 review

Should we publish mechanistic interpretability research?

Marius Hobbhahn and LawrenceC

21 Apr 2023 16:19 UTC

106 points

41 comments13 min readLW link

Shapley Value Attribution in Chain of Thought

leogao14 Apr 2023 5:56 UTC

106 points

7 comments4 min readLW link

AI Alignment Research Engineer Accelerator (ARENA): call for applicants

CallumMcDougall17 Apr 2023 20:30 UTC

100 points

9 comments7 min readLW link