AGI Ruin: A List of Lethalities

Eliezer Yudkowsky5 Jun 2022 22:05 UTC
897 points
690 comments30 min readLW link3 reviews

Where I agree and dis­agree with Eliezer

paulfchristiano19 Jun 2022 19:15 UTC
874 points
219 comments18 min readLW link2 reviews

Eight Short Stud­ies On Excuses

Scott Alexander20 Apr 2010 23:01 UTC
795 points
250 comments10 min readLW link

Preface

Eliezer Yudkowsky11 Mar 2015 19:00 UTC
732 points
14 comments4 min readLW link

The Best Text­books on Every Subject

lukeprog16 Jan 2011 8:30 UTC
709 points
409 comments7 min readLW link

What an ac­tu­ally pes­simistic con­tain­ment strat­egy looks like

lc5 Apr 2022 0:19 UTC
667 points
138 comments6 min readLW link2 reviews

SolidGoldMag­ikarp (plus, prompt gen­er­a­tion)

5 Feb 2023 22:02 UTC
663 points
204 comments12 min readLW link

The Waluigi Effect (mega-post)

Cleo Nardo3 Mar 2023 3:22 UTC
618 points
188 comments16 min readLW link

Simulators

janus2 Sep 2022 12:45 UTC
594 points
161 comments41 min readLW link8 reviews
(generative.ink)

Ra­tion­al­ism be­fore the Sequences

Eric Raymond30 Mar 2021 14:04 UTC
581 points
81 comments11 min readLW link2 reviews

Mak­ing Vaccine

johnswentworth3 Feb 2021 20:24 UTC
574 points
249 comments6 min readLW link3 reviews

Schel­ling fences on slip­pery slopes

Scott Alexander16 Mar 2012 23:44 UTC
570 points
250 comments6 min readLW link

Let’s think about slow­ing down AI

KatjaGrace22 Dec 2022 17:40 UTC
543 points
183 comments38 min readLW link3 reviews
(aiimpacts.org)

Hu­mans are not au­to­mat­i­cally strategic

AnnaSalamon8 Sep 2010 7:02 UTC
521 points
277 comments4 min readLW link

LessWrong’s (first) album: I Have Been A Good Bing

1 Apr 2024 7:33 UTC
517 points
156 comments11 min readLW link

Pain is not the unit of Effort

alkjash24 Nov 2020 20:00 UTC
517 points
89 comments5 min readLW link2 reviews
(radimentary.wordpress.com)

Diseased think­ing: dis­solv­ing ques­tions about disease

Scott Alexander30 May 2010 21:16 UTC
516 points
357 comments9 min readLW link

The Redac­tion Machine

Ben20 Sep 2022 22:03 UTC
495 points
46 comments27 min readLW link1 review

Rea­son as memetic im­mune disorder

PhilGoetz19 Sep 2009 21:05 UTC
484 points
184 comments5 min readLW link

The Talk: a brief ex­pla­na­tion of sex­ual dimorphism

Malmesbury18 Sep 2023 16:23 UTC
481 points
72 comments16 min readLW link

Luck based medicine: my re­sent­ful story of be­com­ing a med­i­cal miracle

Elizabeth16 Oct 2022 17:40 UTC
480 points
119 comments12 min readLW link3 reviews
(acesounderglass.com)

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC
473 points
150 comments16 min readLW link1 review

Los­ing the root for the tree

Adam Zerner20 Sep 2022 4:53 UTC
466 points
30 comments9 min readLW link1 review

How much do you be­lieve your re­sults?

Eric Neyman6 May 2023 20:31 UTC
459 points
14 comments15 min readLW link
(ericneyman.wordpress.com)

It’s Prob­a­bly Not Lithium

Natália28 Jun 2022 21:24 UTC
442 points
186 comments28 min readLW link1 review

Counter-the­ses on Sleep

Natália21 Mar 2022 23:21 UTC
441 points
131 comments15 min readLW link1 review

How To Write Quickly While Main­tain­ing Epistemic Rigor

johnswentworth28 Aug 2021 17:52 UTC
430 points
38 comments4 min readLW link3 reviews

Gen­er­al­iz­ing From One Example

Scott Alexander28 Apr 2009 22:00 UTC
430 points
422 comments6 min readLW link

100 Tips for a Bet­ter Life

Ideopunk22 Dec 2020 14:30 UTC
426 points
130 comments9 min readLW link1 review

Mak­ing Beliefs Pay Rent (in An­ti­ci­pated Ex­pe­riences)

Eliezer Yudkowsky28 Jul 2007 22:59 UTC
425 points
266 comments4 min readLW link

Steer­ing GPT-2-XL by adding an ac­ti­va­tion vector

13 May 2023 18:42 UTC
418 points
97 comments50 min readLW link

The ants and the grasshopper

Richard_Ngo4 Jun 2023 22:00 UTC
416 points
35 comments5 min readLW link
(www.narrativeark.xyz)

Bets, Bonds, and Kindergarteners

jefftk3 Jan 2021 21:20 UTC
414 points
35 comments2 min readLW link1 review
(www.jefftk.com)

Fo­cus on the places where you feel shocked ev­ery­one’s drop­ping the ball

So8res2 Feb 2023 0:27 UTC
413 points
61 comments4 min readLW link

(My un­der­stand­ing of) What Every­one in Tech­ni­cal Align­ment is Do­ing and Why

29 Aug 2022 1:23 UTC
412 points
89 comments38 min readLW link1 review

chin­chilla’s wild implications

nostalgebraist31 Jul 2022 1:18 UTC
410 points
128 comments10 min readLW link1 review

Dou­glas Hofs­tadter changes his mind on Deep Learn­ing & AI risk (June 2023)?

gwern3 Jul 2023 0:48 UTC
410 points
54 comments7 min readLW link
(www.youtube.com)

The non­cen­tral fal­lacy—the worst ar­gu­ment in the world?

Scott Alexander27 Aug 2012 3:36 UTC
408 points
1,768 comments7 min readLW link

Sig­nifi­cantly En­hanc­ing Adult In­tel­li­gence With Gene Edit­ing May Be Possible

12 Dec 2023 18:14 UTC
404 points
162 comments33 min readLW link

Be­ing the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC
404 points
57 comments3 min readLW link3 reviews

It Looks Like You’re Try­ing To Take Over The World

gwern9 Mar 2022 16:35 UTC
404 points
120 comments1 min readLW link1 review
(www.gwern.net)

What failure looks like

paulfchristiano17 Mar 2019 20:18 UTC
401 points
54 comments8 min readLW link2 reviews

Wel­come to LessWrong!

14 Jun 2019 19:42 UTC
401 points
48 comments2 min readLW link

Bing Chat is blatantly, ag­gres­sively misaligned

evhub15 Feb 2023 5:29 UTC
396 points
170 comments2 min readLW link

Things I Learned by Spend­ing Five Thou­sand Hours In Non-EA Charities

jenn1 Jun 2023 20:48 UTC
387 points
34 comments8 min readLW link
(jenn.site)

Reflec­tions on six months of fatherhood

jasoncrawford31 Jan 2022 5:28 UTC
385 points
24 comments4 min readLW link1 review
(jasoncrawford.org)

Dy­ing Outside

HalFinney5 Oct 2009 2:45 UTC
385 points
91 comments2 min readLW link

Deep­Mind al­ign­ment team opinions on AGI ruin arguments

Vika12 Aug 2022 21:06 UTC
376 points
37 comments14 min readLW link1 review

GPTs are Pre­dic­tors, not Imitators

Eliezer Yudkowsky8 Apr 2023 19:59 UTC
375 points
90 comments3 min readLW link

Ugh fields

Roko12 Apr 2010 17:06 UTC
374 points
80 comments3 min readLW link