Ly­ing to Save Humanity

cebsuvx14 Nov 2022 23:04 UTC
−1 points
4 comments1 min readLW link

Mo­ral con­ta­gion heuristic

Mvolz14 Nov 2022 21:17 UTC
14 points
3 comments2 min readLW link

Will we run out of ML data? Ev­i­dence from pro­ject­ing dataset size trends

Pablo Villalobos14 Nov 2022 16:42 UTC
75 points
12 comments2 min readLW link
(epochai.org)

I (with the help of a few more peo­ple) am plan­ning to cre­ate an in­tro­duc­tion to AI Safety that a smart teenager can un­der­stand. What am I miss­ing?

Tapatakt14 Nov 2022 16:12 UTC
3 points
5 comments1 min readLW link

Two New New­comb Variants

eva_14 Nov 2022 14:01 UTC
26 points
22 comments3 min readLW link

Im­prov­ing Emer­gency Ve­hi­cle Utilization

jefftk14 Nov 2022 14:00 UTC
15 points
10 comments1 min readLW link
(www.jefftk.com)

X-risk Miti­ga­tion Does Ac­tu­ally Re­quire Longter­mism

DragonGod14 Nov 2022 12:54 UTC
6 points
1 comment1 min readLW link

[Question] Why don’t we have self driv­ing cars yet?

Linda Linsefors14 Nov 2022 12:19 UTC
22 points
16 comments1 min readLW link

Ei­gen­val­ues for Dis­tance from The Bud­dhist Pre­cepts And The Ten Commandments

benjamin.j.campbell14 Nov 2022 5:50 UTC
−3 points
2 comments1 min readLW link

AI Safety Micro­grant Round

Chris_Leong14 Nov 2022 4:25 UTC
22 points
1 comment1 min readLW link

Es­ti­mat­ing the prob­a­bil­ity that FTX Fu­ture Fund grant money gets clawed back

spencerg14 Nov 2022 3:33 UTC
28 points
6 comments1 min readLW link

Ra­tional over­con­fi­dence in the tens of billions: re­cent example

banev13 Nov 2022 22:48 UTC
−20 points
3 comments2 min readLW link

In Defence of Tem­po­ral Dis­count­ing in Longter­mist Ethics

DragonGod13 Nov 2022 21:54 UTC
23 points
4 comments1 min readLW link

An­nounc­ing Non­lin­ear Emer­gency Funding

KatWoods13 Nov 2022 19:02 UTC
54 points
0 comments1 min readLW link

The Align­ment Com­mu­nity Is Cul­turally Broken

sudo13 Nov 2022 18:53 UTC
136 points
68 comments2 min readLW link

The Fu­til­ity of Sta­tus and Signalling

Ape in the coat13 Nov 2022 17:14 UTC
19 points
4 comments3 min readLW link

A short cri­tique of Vanessa Kosoy’s PreDCA

Martín Soto13 Nov 2022 16:00 UTC
27 points
8 comments4 min readLW link

What’s the Alter­na­tive to In­de­pen­dence?

jefftk13 Nov 2022 15:30 UTC
50 points
3 comments1 min readLW link
(www.jefftk.com)

De­ci­sion mak­ing un­der model am­bi­guity, moral un­cer­tainty, and other agents with free will?

Jobst Heitzig13 Nov 2022 12:50 UTC
4 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

The sky is not blue (par­don the ob­vi­ous­ness)

banev13 Nov 2022 10:49 UTC
−13 points
6 comments1 min readLW link

Char­ac­ter­iz­ing In­trin­sic Com­po­si­tion­al­ity in Trans­form­ers with Tree Projections

Ulisse Mini13 Nov 2022 9:46 UTC
12 points
2 comments1 min readLW link
(arxiv.org)

Not­ing an un­sub­stan­ti­ated com­mu­nal be­lief about the FTX disaster

Yitz13 Nov 2022 5:37 UTC
50 points
52 comments1 min readLW link

Sols­tice 2022 Roundup

dspeyer12 Nov 2022 21:26 UTC
34 points
12 comments1 min readLW link

Women and Effec­tive Altruism

P. G. Keerthana Gopalakrishnan12 Nov 2022 20:57 UTC
−29 points
15 comments2 min readLW link
(keerthanapg.com)

A Poem for S.B.F.

AnthonyRepetto12 Nov 2022 20:41 UTC
−30 points
21 comments1 min readLW link

Mus­ings on the ap­pro­pri­ate tar­gets for standards

tailcalled12 Nov 2022 20:19 UTC
11 points
13 comments1 min readLW link

Ways to buy time

12 Nov 2022 19:31 UTC
34 points
23 comments12 min readLW link

[Question] How do new­com­ers delve deeper into the com­mu­nity?

Lord Dreadwar12 Nov 2022 19:00 UTC
7 points
2 comments1 min readLW link

fully al­igned sin­gle­ton as a solu­tion to everything

Tamsin Leake12 Nov 2022 18:19 UTC
6 points
2 comments2 min readLW link
(carado.moe)

User-Con­trol­led Al­gorith­mic Feeds

jefftk12 Nov 2022 15:20 UTC
35 points
7 comments2 min readLW link
(www.jefftk.com)

Vanessa Kosoy’s PreDCA, distilled

Martín Soto12 Nov 2022 11:38 UTC
17 points
19 comments5 min readLW link

Poster Ses­sion on AI Safety

Neil Crawford12 Nov 2022 3:50 UTC
7 points
6 comments1 min readLW link

Is AI Gain-of-Func­tion re­search a thing?

MadHatter12 Nov 2022 2:33 UTC
9 points
2 comments2 min readLW link

Why don’t or­ga­ni­za­tions have a CREAMO?

shminux12 Nov 2022 2:19 UTC
0 points
8 comments1 min readLW link

“Ru­de­ness”, a use­ful co­or­di­na­tion mechanic

Raemon11 Nov 2022 22:27 UTC
49 points
20 comments2 min readLW link

In­ter­nal­iz­ing the dam­age of bad-act­ing part­ners cre­ates in­cen­tives for due diligence

tailcalled11 Nov 2022 20:57 UTC
17 points
7 comments1 min readLW link

Spec­u­la­tion on Cur­rent Op­por­tu­ni­ties for Unusu­ally High Im­pact in Global Health

johnswentworth11 Nov 2022 20:47 UTC
114 points
31 comments4 min readLW link

[Question] Is acausal ex­tor­tion pos­si­ble?

sisyphus11 Nov 2022 19:48 UTC
−20 points
36 comments3 min readLW link

Cathar­sis in Bb

jefftk11 Nov 2022 17:40 UTC
6 points
0 comments1 min readLW link
(www.jefftk.com)

In­stru­men­tal con­ver­gence is what makes gen­eral in­tel­li­gence possible

tailcalled11 Nov 2022 16:38 UTC
97 points
11 comments4 min readLW link

Weekly Roundup #5

Zvi11 Nov 2022 16:20 UTC
33 points
0 comments6 min readLW link
(thezvi.wordpress.com)

Charg­ing for the Dharma

jchan11 Nov 2022 14:02 UTC
32 points
18 comments5 min readLW link

[Question] EA (& AI Safety) has over­es­ti­mated its pro­jected fund­ing — which de­ci­sions must be re­vised?

Cleo Nardo11 Nov 2022 13:50 UTC
22 points
7 comments1 min readLW link
(forum.effectivealtruism.org)

Where the log­i­cal fal­lacy is not (Gen­er­al­iza­tion From Fic­tional Ev­i­dence)

banev11 Nov 2022 10:41 UTC
−12 points
14 comments1 min readLW link

Why I’m Work­ing On Model Ag­nos­tic Interpretability

Jessica Rumbelow11 Nov 2022 9:24 UTC
26 points
9 comments2 min readLW link

How likely are ma­lign pri­ors over ob­jec­tives? [aborted WIP]

David Johnston11 Nov 2022 5:36 UTC
−1 points
0 comments8 min readLW link

Do Time­less De­ci­sion The­o­rists re­ject all black­mail from other Time­less De­ci­sion The­o­rists?

myren11 Nov 2022 0:38 UTC
7 points
8 comments3 min readLW link

We must be very clear: fraud in the ser­vice of effec­tive al­tru­ism is unacceptable

evhub10 Nov 2022 23:31 UTC
42 points
56 comments1 min readLW link

[simu­la­tion] 4chan user claiming to be the at­tor­ney hired by Google’s sen­tient chat­bot LaMDA shares wild de­tails of encounter

janus10 Nov 2022 21:39 UTC
19 points
1 comment13 min readLW link
(generative.ink)

di­v­ine carrot

Alok Singh10 Nov 2022 20:50 UTC
18 points
2 comments1 min readLW link
(alok.github.io)