Deferring

owencb12 May 2022 23:56 UTC
18 points
2 comments11 min readLW link

RLHF

Ansh Radhakrishnan12 May 2022 21:18 UTC
18 points
5 comments5 min readLW link

[Question] What to do when start­ing a busi­ness in an im­mi­nent-AGI world?

ryan_b12 May 2022 21:07 UTC
25 points
7 comments1 min readLW link

In­ter­pretabil­ity’s Align­ment-Solv­ing Po­ten­tial: Anal­y­sis of 7 Scenarios

Evan R. Murphy12 May 2022 20:01 UTC
58 points
0 comments59 min readLW link

In­tro­duc­tion to the se­quence: In­ter­pretabil­ity Re­search for the Most Im­por­tant Century

Evan R. Murphy12 May 2022 19:59 UTC
16 points
0 comments8 min readLW link

A ten­ta­tive di­alogue with a Friendly-boxed-su­per-AGI on brain uploads

Ramiro P.12 May 2022 19:40 UTC
1 point
12 comments4 min readLW link

The Last Paperclip

Logan Zoellner12 May 2022 19:25 UTC
63 points
15 comments18 min readLW link

Deep­mind’s Gato: Gen­er­al­ist Agent

Daniel Kokotajlo12 May 2022 16:01 UTC
165 points
62 comments1 min readLW link

“A Gen­er­al­ist Agent”: New Deep­Mind Publication

1a3orn12 May 2022 15:30 UTC
79 points
43 comments1 min readLW link

Covid 5/​12/​22: Other Priorities

Zvi12 May 2022 13:30 UTC
31 points
4 comments15 min readLW link
(thezvi.wordpress.com)

[Question] How would pub­lic me­dia out­lets need to be gov­erned to cover all poli­ti­cal views?

ChristianKl12 May 2022 12:55 UTC
13 points
14 comments1 min readLW link

[Question] What’s keep­ing con­cerned ca­pa­bil­ities gain re­searchers from leav­ing the field?

sovran12 May 2022 12:16 UTC
19 points
4 comments1 min readLW link

Pos­i­tive out­comes un­der an un­al­igned AGI takeover

Yitz12 May 2022 7:45 UTC
19 points
10 comments3 min readLW link

[Question] What are your recom­men­da­tions for tech­ni­cal AI al­ign­ment pod­casts?

Evan_Gaensbauer11 May 2022 21:52 UTC
5 points
4 comments1 min readLW link

Grace­fully cor­rect­ing un­cal­ibrated shame

AF202211 May 2022 19:51 UTC
−31 points
34 comments4 min readLW link

[In­tro to brain-like-AGI safety] 14. Con­trol­led AGI

Steven Byrnes11 May 2022 13:17 UTC
45 points
25 comments20 min readLW link

Pro­jec­tLawful.com: Eliezer’s lat­est story, past 1M words

Eliezer Yudkowsky11 May 2022 6:18 UTC
238 points
112 comments1 min readLW link4 reviews

An In­side View of AI Alignment

Ansh Radhakrishnan11 May 2022 2:16 UTC
32 points
2 comments2 min readLW link

Fight­ing in var­i­ous places for a re­ally long time

KatjaGrace11 May 2022 1:50 UTC
36 points
12 comments4 min readLW link
(worldspiritsockpuppet.com)

Stuff I might do if I had covid

KatjaGrace11 May 2022 0:00 UTC
39 points
9 comments1 min readLW link
(worldspiritsockpuppet.com)

Crises Don’t Need Your Software

GabrielExists10 May 2022 21:06 UTC
59 points
18 comments6 min readLW link

Ceiling Fan Air Filter

jefftk10 May 2022 14:20 UTC
18 points
9 comments1 min readLW link
(www.jefftk.com)

The limits of AI safety via debate

Marius Hobbhahn10 May 2022 13:33 UTC
36 points
8 comments10 min readLW link

Ex­am­in­ing Arm­strong’s cat­e­gory of gen­er­al­ized models

Morgan_Rogers10 May 2022 9:07 UTC
14 points
0 comments7 min readLW link

Dath Ilani Rule of Law

David Udell10 May 2022 6:17 UTC
24 points
25 comments4 min readLW link

AI safety should be made more ac­cessible us­ing non text-based media

Massimog10 May 2022 3:14 UTC
2 points
4 comments4 min readLW link

LessWrong Now Has Dark Mode

jimrandomh10 May 2022 1:21 UTC
140 points
31 comments1 min readLW link

Con­di­tions for math­e­mat­i­cal equiv­alence of Stochas­tic Gra­di­ent Des­cent and Nat­u­ral Selection

Oliver Sourbut9 May 2022 21:38 UTC
70 points
19 comments8 min readLW link1 review
(www.oliversourbut.net)

AI Align­ment YouTube Playlists

9 May 2022 21:33 UTC
31 points
4 comments1 min readLW link

When is AI safety re­search harm­ful?

NathanBarnard9 May 2022 18:19 UTC
2 points
0 comments8 min readLW link

A Bird’s Eye View of the ML Field [Prag­matic AI Safety #2]

9 May 2022 17:18 UTC
165 points
8 comments35 min readLW link

In­tro­duc­tion to Prag­matic AI Safety [Prag­matic AI Safety #1]

9 May 2022 17:06 UTC
80 points
3 comments6 min readLW link

Jobs: Help scale up LM al­ign­ment re­search at NYU

Sam Bowman9 May 2022 14:12 UTC
60 points
1 comment1 min readLW link

Micro­phone on Elec­tric Mandolin

jefftk9 May 2022 14:00 UTC
16 points
0 comments1 min readLW link
(www.jefftk.com)

[Question] Thought ex­per­i­ment: Imag­ine you were as­signed to help a ran­dom per­son in your com­mu­nity be­come as peace­ful and joyful as the most peace­ful and joyful per­son you’d ever met. What would you try?

nonzerosum9 May 2022 13:53 UTC
5 points
5 comments1 min readLW link

[Question] Willing to be your mu­sic men­tor in ex­change for video edit­ing mentorship

monkymind9 May 2022 11:57 UTC
8 points
0 comments1 min readLW link

Up­dat­ing Utility Functions

9 May 2022 9:44 UTC
42 points
6 comments8 min readLW link

Tran­scripts of in­ter­views with AI researchers

Vael Gates9 May 2022 5:57 UTC
170 points
9 comments2 min readLW link

[Scrib­ble] Bad Rea­sons Be­hind Differ­ent Sys­tems and a Story with No Good Moral

Rana Dexsin9 May 2022 5:21 UTC
9 points
0 comments5 min readLW link

[Question] What is the best day to cel­e­brate Smal­lpox Erad­i­ca­tion Day?

Orborde9 May 2022 4:02 UTC
7 points
6 comments1 min readLW link

A rea­son be­hind bad sys­tems, and moral im­pli­ca­tions of see­ing this reason

Edward Pascal9 May 2022 3:16 UTC
4 points
12 comments2 min readLW link

An Alter­na­tive In­ter­pre­ta­tion of Physics

dadadarren9 May 2022 0:52 UTC
19 points
10 comments5 min readLW link
(www.sleepingbeautyproblem.com)

Ion Im­plan­ta­tion: The­ory, Equip­ment, Pro­cess, Alternatives

nomagicpill8 May 2022 22:30 UTC
6 points
0 comments16 min readLW link
(210ethan.github.io)

[Question] Long COVID risk: How to main­tain an up to date risk as­sess­ment so we can go back to nor­mal life?

Sameerishere8 May 2022 19:56 UTC
19 points
34 comments1 min readLW link

De­mon­strat­ing MWI by in­terfer­ing hu­man simulations

Yair Halberstadt8 May 2022 17:28 UTC
12 points
25 comments2 min readLW link

Notes from a con­ver­sa­tion with Ing. Agr. Adri­ana Balzarini

Pablo Repetto8 May 2022 15:56 UTC
5 points
0 comments2 min readLW link
(pabloernesto.github.io)

Ele­men­tary In­fra-Bayesianism

Jan8 May 2022 12:23 UTC
41 points
3 comments7 min readLW link
(universalprior.substack.com)

Cam­bridge LW Meetup: Books That Change

8 May 2022 5:23 UTC
5 points
0 comments1 min readLW link

Video and Tran­script of Pre­sen­ta­tion on Ex­is­ten­tial Risk from Power-Seek­ing AI

Joe Carlsmith8 May 2022 3:50 UTC
20 points
1 comment29 min readLW link

[Question] Al­gorith­mic for­mal­iza­tion of FDT?

Shmi8 May 2022 1:36 UTC
12 points
8 comments1 min readLW link