AGI Ruin: A List of Lethalities

Eliezer Yudkowsky5 Jun 2022 22:05 UTC
902 points
690 comments30 min readLW link3 reviews

Where I agree and dis­agree with Eliezer

paulfchristiano19 Jun 2022 19:15 UTC
881 points
219 comments18 min readLW link2 reviews

Eight Short Stud­ies On Excuses

Scott Alexander20 Apr 2010 23:01 UTC
808 points
251 comments10 min readLW link

Preface

Eliezer Yudkowsky11 Mar 2015 19:00 UTC
750 points
14 comments4 min readLW link

The Best Text­books on Every Subject

lukeprog16 Jan 2011 8:30 UTC
724 points
410 comments7 min readLW link

What an ac­tu­ally pes­simistic con­tain­ment strat­egy looks like

lc5 Apr 2022 0:19 UTC
670 points
138 comments6 min readLW link2 reviews

SolidGoldMag­ikarp (plus, prompt gen­er­a­tion)

5 Feb 2023 22:02 UTC
669 points
205 comments12 min readLW link

The Waluigi Effect (mega-post)

Cleo Nardo3 Mar 2023 3:22 UTC
624 points
188 comments16 min readLW link

Simulators

janus2 Sep 2022 12:45 UTC
599 points
161 comments41 min readLW link8 reviews
(generative.ink)

Ra­tion­al­ism be­fore the Sequences

Eric Raymond30 Mar 2021 14:04 UTC
582 points
81 comments11 min readLW link2 reviews

Schel­ling fences on slip­pery slopes

Scott Alexander16 Mar 2012 23:44 UTC
580 points
250 comments6 min readLW link

Mak­ing Vaccine

johnswentworth3 Feb 2021 20:24 UTC
574 points
249 comments6 min readLW link3 reviews

LessWrong’s (first) album: I Have Been A Good Bing

1 Apr 2024 7:33 UTC
548 points
172 comments11 min readLW link

Let’s think about slow­ing down AI

KatjaGrace22 Dec 2022 17:40 UTC
546 points
182 comments38 min readLW link3 reviews
(aiimpacts.org)

Hu­mans are not au­to­mat­i­cally strategic

AnnaSalamon8 Sep 2010 7:02 UTC
532 points
277 comments4 min readLW link

Pain is not the unit of Effort

alkjash24 Nov 2020 20:00 UTC
525 points
89 comments5 min readLW link2 reviews
(radimentary.wordpress.com)

Diseased think­ing: dis­solv­ing ques­tions about disease

Scott Alexander30 May 2010 21:16 UTC
519 points
357 comments9 min readLW link

The Redac­tion Machine

Ben20 Sep 2022 22:03 UTC
495 points
46 comments27 min readLW link1 review

Rea­son as memetic im­mune disorder

PhilGoetz19 Sep 2009 21:05 UTC
489 points
184 comments5 min readLW link

The Talk: a brief ex­pla­na­tion of sex­ual dimorphism

Malmesbury18 Sep 2023 16:23 UTC
484 points
72 comments16 min readLW link

Luck based medicine: my re­sent­ful story of be­com­ing a med­i­cal miracle

Elizabeth16 Oct 2022 17:40 UTC
480 points
120 comments12 min readLW link3 reviews
(acesounderglass.com)

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC
477 points
153 comments16 min readLW link1 review

Los­ing the root for the tree

Adam Zerner20 Sep 2022 4:53 UTC
468 points
30 comments9 min readLW link1 review

How much do you be­lieve your re­sults?

Eric Neyman6 May 2023 20:31 UTC
463 points
14 comments15 min readLW link
(ericneyman.wordpress.com)

Mak­ing Beliefs Pay Rent (in An­ti­ci­pated Ex­pe­riences)

Eliezer Yudkowsky28 Jul 2007 22:59 UTC
443 points
266 comments4 min readLW link

Counter-the­ses on Sleep

Natália21 Mar 2022 23:21 UTC
442 points
131 comments15 min readLW link1 review

It’s Prob­a­bly Not Lithium

Natália28 Jun 2022 21:24 UTC
442 points
186 comments28 min readLW link1 review

How To Write Quickly While Main­tain­ing Epistemic Rigor

johnswentworth28 Aug 2021 17:52 UTC
435 points
38 comments4 min readLW link3 reviews

100 Tips for a Bet­ter Life

Ideopunk22 Dec 2020 14:30 UTC
433 points
130 comments9 min readLW link1 review

Gen­er­al­iz­ing From One Example

Scott Alexander28 Apr 2009 22:00 UTC
432 points
422 comments6 min readLW link

Steer­ing GPT-2-XL by adding an ac­ti­va­tion vector

13 May 2023 18:42 UTC
426 points
97 comments50 min readLW link

Fo­cus on the places where you feel shocked ev­ery­one’s drop­ping the ball

So8res2 Feb 2023 0:27 UTC
421 points
61 comments4 min readLW link

Sig­nifi­cantly En­hanc­ing Adult In­tel­li­gence With Gene Edit­ing May Be Possible

12 Dec 2023 18:14 UTC
421 points
163 comments33 min readLW link

chin­chilla’s wild implications

nostalgebraist31 Jul 2022 1:18 UTC
418 points
128 comments10 min readLW link1 review

The ants and the grasshopper

Richard_Ngo4 Jun 2023 22:00 UTC
418 points
35 comments5 min readLW link
(www.narrativeark.xyz)

Dou­glas Hofs­tadter changes his mind on Deep Learn­ing & AI risk (June 2023)?

gwern3 Jul 2023 0:48 UTC
417 points
54 comments7 min readLW link
(www.youtube.com)

Bets, Bonds, and Kindergarteners

jefftk3 Jan 2021 21:20 UTC
415 points
35 comments2 min readLW link1 review
(www.jefftk.com)

Be­ing the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC
412 points
57 comments3 min readLW link3 reviews

The non­cen­tral fal­lacy—the worst ar­gu­ment in the world?

Scott Alexander27 Aug 2012 3:36 UTC
412 points
1,768 comments7 min readLW link

(My un­der­stand­ing of) What Every­one in Tech­ni­cal Align­ment is Do­ing and Why

29 Aug 2022 1:23 UTC
412 points
90 comments38 min readLW link1 review

Wel­come to LessWrong!

14 Jun 2019 19:42 UTC
410 points
50 comments2 min readLW link

What failure looks like

paulfchristiano17 Mar 2019 20:18 UTC
409 points
54 comments8 min readLW link2 reviews

It Looks Like You’re Try­ing To Take Over The World

gwern9 Mar 2022 16:35 UTC
405 points
120 comments1 min readLW link1 review
(www.gwern.net)

Trans­form­ers Rep­re­sent Belief State Geom­e­try in their Resi­d­ual Stream

Adam Shai16 Apr 2024 21:16 UTC
397 points
100 comments12 min readLW link

Bing Chat is blatantly, ag­gres­sively misaligned

evhub15 Feb 2023 5:29 UTC
396 points
170 comments2 min readLW link

Dy­ing Outside

HalFinney5 Oct 2009 2:45 UTC
393 points
91 comments2 min readLW link

I would have shit in that alley, too

Declan Molony18 Jun 2024 4:41 UTC
392 points
122 comments4 min readLW link

Things I Learned by Spend­ing Five Thou­sand Hours In Non-EA Charities

jenn1 Jun 2023 20:48 UTC
390 points
34 comments8 min readLW link
(jenn.site)

Deep­Mind al­ign­ment team opinions on AGI ruin arguments

Vika12 Aug 2022 21:06 UTC
389 points
37 comments14 min readLW link1 review

GPTs are Pre­dic­tors, not Imitators

Eliezer Yudkowsky8 Apr 2023 19:59 UTC
387 points
90 comments3 min readLW link