What an ac­tu­ally pes­simistic con­tain­ment strat­egy looks like

lc5 Apr 2022 0:19 UTC
667 points
138 comments6 min readLW link2 reviews

Lies Told To Children

Eliezer Yudkowsky14 Apr 2022 11:25 UTC
370 points
94 comments7 min readLW link1 review

Ac­count­ing For Col­lege Costs

johnswentworth1 Apr 2022 17:28 UTC
357 points
41 comments7 min readLW link

MIRI an­nounces new “Death With Dig­nity” strategy

Eliezer Yudkowsky2 Apr 2022 0:43 UTC
341 points
543 comments18 min readLW link1 review

Op­ti­mal­ity is the tiger, and agents are its teeth

Veedrac2 Apr 2022 0:46 UTC
301 points
42 comments16 min readLW link1 review

Don’t die with dig­nity; in­stead play to your outs

Jeffrey Ladish6 Apr 2022 7:53 UTC
270 points
59 comments5 min readLW link

New Scal­ing Laws for Large Lan­guage Models

1a3orn1 Apr 2022 20:41 UTC
243 points
22 comments5 min readLW link

A Quick Guide to Con­fronting Doom

Ruby13 Apr 2022 19:30 UTC
240 points
33 comments2 min readLW link

Edit­ing Ad­vice for LessWrong Users

JustisMills11 Apr 2022 16:32 UTC
231 points
14 comments6 min readLW link1 review

Re­plac­ing Karma with Good Heart To­kens (Worth $1!)

1 Apr 2022 9:31 UTC
224 points
173 comments4 min readLW link

Moses and the Class Struggle

lsusr1 Apr 2022 11:55 UTC
214 points
26 comments5 min readLW link

Call For Distillers

johnswentworth4 Apr 2022 18:25 UTC
205 points
43 comments3 min readLW link1 review

A con­crete bet offer to those with short AGI timelines

9 Apr 2022 21:41 UTC
198 points
116 comments5 min readLW link

We Are Con­jec­ture, A New Align­ment Re­search Startup

Connor Leahy8 Apr 2022 11:40 UTC
197 points
25 comments4 min readLW link

dalle2 comments

nostalgebraist26 Apr 2022 5:30 UTC
183 points
14 comments13 min readLW link
(nostalgebraist.tumblr.com)

Play­ing with DALL·E 2

Dave Orr7 Apr 2022 18:49 UTC
165 points
118 comments6 min readLW link

Every­thing I Need To Know About Take­off Speeds I Learned From Air Con­di­tioner Rat­ings On Amazon

johnswentworth15 Apr 2022 19:05 UTC
159 points
128 comments5 min readLW link

Emo­tion­ally Con­fronting a Prob­a­bly-Doomed World: Against Mo­ti­va­tion Via Dig­nity Points

TurnTrout10 Apr 2022 18:45 UTC
151 points
7 comments9 min readLW link

Slack gives you space to no­tice/​re­flect on sub­tle things

Raemon24 Apr 2022 2:30 UTC
147 points
18 comments1 min readLW link

Refine: An In­cu­ba­tor for Con­cep­tual Align­ment Re­search Bets

adamShimi15 Apr 2022 8:57 UTC
144 points
13 comments4 min readLW link

Take­off speeds have a huge effect on what it means to work on AI x-risk

Buck13 Apr 2022 17:38 UTC
139 points
27 comments2 min readLW link2 reviews

“Pivotal Act” In­ten­tions: Nega­tive Con­se­quences and Fal­la­cious Arguments

Andrew_Critch19 Apr 2022 20:25 UTC
138 points
55 comments7 min readLW link1 review

Su­per­vise Pro­cess, not Outcomes

5 Apr 2022 22:18 UTC
134 points
9 comments10 min readLW link

Only Ask­ing Real Questions

jefftk14 Apr 2022 15:50 UTC
128 points
45 comments2 min readLW link
(www.jefftk.com)

High school­ers can ap­ply to the At­las Fel­low­ship: $50k schol­ar­ship + sum­mer program

sydney3 Apr 2022 0:53 UTC
122 points
18 comments2 min readLW link

Greyed Out Options

ozymandias4 Apr 2022 20:43 UTC
122 points
12 comments5 min readLW link1 review

Moloch and the sand­pile catastrophe

Eric Raymond2 Apr 2022 15:35 UTC
120 points
25 comments3 min readLW link

Con­vinc­ing All Ca­pa­bil­ity Researchers

Logan Riggs8 Apr 2022 17:40 UTC
120 points
70 comments3 min readLW link

Are smart peo­ple’s per­sonal ex­pe­riences bi­ased against gen­eral in­tel­li­gence?

tailcalled21 Apr 2022 19:25 UTC
114 points
43 comments3 min readLW link

Pr­ereg­is­tra­tion: Air Con­di­tioner Test

johnswentworth21 Apr 2022 19:48 UTC
112 points
59 comments9 min readLW link

Ex­plain­ing the Twit­ter Pos­trat Scene

Jacob Falkovich5 Apr 2022 22:23 UTC
112 points
27 comments5 min readLW link

[RETRACTED] It’s time for EA lead­er­ship to pull the short-timelines fire alarm.

Not Relevant8 Apr 2022 16:07 UTC
109 points
163 comments4 min readLW link

A broad basin of at­trac­tion around hu­man val­ues?

Wei Dai12 Apr 2022 5:15 UTC
109 points
17 comments2 min readLW link

Google’s new 540 billion pa­ram­e­ter lan­guage model

Matthew Barnett4 Apr 2022 17:49 UTC
108 points
81 comments1 min readLW link
(storage.googleapis.com)

Ideal gov­er­nance (for com­pa­nies, coun­tries and more)

HoldenKarnofsky5 Apr 2022 18:30 UTC
108 points
63 comments14 min readLW link
(www.cold-takes.com)

Book re­view: Very Im­por­tant People

Richard_Ngo2 Apr 2022 19:00 UTC
107 points
18 comments3 min readLW link
(thinkingcomplete.blogspot.com)

Test­ing PaLM prompts on GPT3

Yitz6 Apr 2022 5:21 UTC
103 points
14 comments8 min readLW link

In­tu­itions about solv­ing hard problems

Richard_Ngo25 Apr 2022 15:29 UTC
102 points
23 comments6 min readLW link

Giv­ing cal­ibrated time es­ti­mates can have so­cial costs

Alex_Altair3 Apr 2022 21:23 UTC
99 points
16 comments5 min readLW link

Deep­Mind: The Pod­cast—Ex­cerpts on AGI

WilliamKiely7 Apr 2022 22:09 UTC
99 points
11 comments5 min readLW link

Clem’s Memo

abstractapplic16 Apr 2022 11:59 UTC
98 points
8 comments3 min readLW link

Good Heart Week: Ex­tend­ing the Experiment

Ben Pace2 Apr 2022 7:13 UTC
97 points
92 comments3 min readLW link

Ukraine Post #9: Again

Zvi5 Apr 2022 19:40 UTC
97 points
37 comments16 min readLW link
(thezvi.wordpress.com)

What Would A Fight Between Hu­man­ity And AGI Look Like?

johnswentworth5 Apr 2022 20:03 UTC
97 points
20 comments3 min readLW link

Pro­duc­tive Mis­takes, Not Perfect Answers

adamShimi7 Apr 2022 16:41 UTC
97 points
11 comments6 min readLW link

The case for Do­ing Some­thing Else (if Align­ment is doomed)

Rafael Harth5 Apr 2022 17:52 UTC
93 points
14 comments2 min readLW link

Anti-Cor­rup­tion Market

lsusr1 Apr 2022 12:57 UTC
93 points
23 comments2 min readLW link

[Question] Con­vince me that hu­man­ity is as doomed by AGI as Yud­kowsky et al., seems to believe

Yitz10 Apr 2022 21:02 UTC
92 points
141 comments2 min readLW link

Code Gen­er­a­tion as an AI risk setting

Not Relevant17 Apr 2022 22:27 UTC
91 points
16 comments2 min readLW link

Work­ing Out in VR Really Works

Yonatan Cale3 Apr 2022 18:42 UTC
90 points
28 comments3 min readLW link