Lat­a­cora might be of in­ter­est to some AI Safety organizations

NunoSempereNov 25, 2021, 11:57 PM
14 points
10 comments1 min readLW link
(www.latacora.com)

Chris­ti­ano, Co­tra, and Yud­kowsky on AI progress

Nov 25, 2021, 4:45 PM
119 points
95 comments66 min readLW link

Covid 11/​25: Another Thanksgiving

ZviNov 25, 2021, 1:40 PM
73 points
9 comments21 min readLW link
(thezvi.wordpress.com)

Co­or­di­nat­ing the Unequal Treaties

lsusrNov 25, 2021, 10:47 AM
34 points
4 comments2 min readLW link

First Strike and Se­cond Strike

lsusrNov 25, 2021, 9:23 AM
28 points
5 comments1 min readLW link

You are way more fal­lible than you think

ShmiNov 25, 2021, 5:52 AM
4 points
14 comments2 min readLW link

[Linkpost] Danger of mo­ti­va­tio­ge­n­e­sis in in­ter­dis­ci­plinary work

particlemaniaNov 25, 2021, 12:13 AM
9 points
0 comments1 min readLW link

Meetup for The Roots of Progress in San Diego, Dec 1

jasoncrawfordNov 24, 2021, 10:50 PM
7 points
0 comments1 min readLW link
(rootsofprogress.org)

Base Rates and Refer­ence Classes

jsteinhardtNov 24, 2021, 10:30 PM
20 points
7 comments5 min readLW link
(bounded-regret.ghost.io)

Why do you need the story?

George3d6Nov 24, 2021, 8:26 PM
52 points
11 comments5 min readLW link
(cerebralab.com)

[AN #169]: Col­lab­o­rat­ing with hu­mans with­out hu­man data

Rohin ShahNov 24, 2021, 6:30 PM
33 points
0 comments8 min readLW link
(mailchi.mp)

Paxlovid Re­mains Ille­gal: 11/​24 Update

ZviNov 24, 2021, 1:40 PM
54 points
21 comments7 min readLW link
(thezvi.wordpress.com)

HIRING: In­form and shape a new pro­ject on AI safety at Part­ner­ship on AI

Madhulika SrikumarNov 24, 2021, 8:27 AM
6 points
0 comments1 min readLW link

[Question] How much Bayesian ev­i­dence from rapid anti­gen and PCR tests?

mingyuanNov 24, 2021, 6:54 AM
8 points
4 comments1 min readLW link

French long COVID study: Belief vs Infection

BuckyNov 23, 2021, 11:14 PM
40 points
11 comments5 min readLW link

[Question] Cor­nell Meetup

Lionel LevineNov 23, 2021, 9:28 PM
6 points
4 comments1 min readLW link

AI Tracker: mon­i­tor­ing cur­rent and near-fu­ture risks from su­per­scale models

Nov 23, 2021, 7:16 PM
67 points
13 comments3 min readLW link
(aitracker.org)

Laplace’s rule of succession

Ege ErdilNov 23, 2021, 3:48 PM
52 points
2 comments7 min readLW link

AI Safety Needs Great Engineers

Andy JonesNov 23, 2021, 3:40 PM
90 points
43 comments4 min readLW link

Slightly ad­vanced de­ci­sion the­ory 102: Four rea­sons not to be a (naive) util­ity maximizer

JanNov 23, 2021, 11:02 AM
10 points
1 comment15 min readLW link
(universalprior.substack.com)

Use Tools For What They’re For

DirectedEvolutionNov 23, 2021, 8:26 AM
28 points
14 comments8 min readLW link

[linkpost] Ac­qui­si­tion of Chess Knowl­edge in AlphaZero

Quintin PopeNov 23, 2021, 7:55 AM
8 points
1 comment1 min readLW link

[linkpost] Why Go­ing to the Doc­tor Sucks (WaitButWhy)

mike_hawkeNov 23, 2021, 3:02 AM
5 points
11 comments1 min readLW link
(waitbutwhy.com)

In­te­grat­ing Three Models of (Hu­man) Cognition

jbkjrNov 23, 2021, 1:06 AM
40 points
4 comments32 min readLW link

Po­ten­tial Align­ment men­tal tool: Keep­ing track of the types

Donald HobsonNov 22, 2021, 8:05 PM
29 points
1 comment2 min readLW link

Yud­kowsky and Chris­ti­ano dis­cuss “Take­off Speeds”

Eliezer YudkowskyNov 22, 2021, 7:35 PM
210 points
176 comments60 min readLW link1 review

Mo­rally un­der­defined situ­a­tions can be deadly

Stuart_ArmstrongNov 22, 2021, 2:48 PM
17 points
8 comments2 min readLW link

A Bayesian Ag­gre­ga­tion Paradox

JsevillamolNov 22, 2021, 10:39 AM
87 points
23 comments7 min readLW link

[Question] Do fac­tored sets elu­ci­date any­thing about how to up­date ev­ery­day be­liefs?

TekhneMakreNov 22, 2021, 6:51 AM
5 points
1 comment1 min readLW link

Even if you’re right, you’re wrong

DanielFilanNov 22, 2021, 5:40 AM
17 points
5 comments1 min readLW link
(danielfilan.com)

The Meta-Puzzle

DanielFilanNov 22, 2021, 5:30 AM
23 points
27 comments3 min readLW link
(danielfilan.com)

Some real ex­am­ples of gra­di­ent hacking

Oliver SourbutNov 22, 2021, 12:11 AM
15 points
8 comments2 min readLW link

“The Wis­dom of the Lazy Teacher”

Richard_KennawayNov 21, 2021, 9:11 PM
16 points
5 comments1 min readLW link

Vi­talik: Cryp­toe­co­nomics and X-Risk Re­searchers Should Listen to Each Other More

Emerson SpartzNov 21, 2021, 6:53 PM
47 points
9 comments5 min readLW link

Giv­ing Up On T-Mobile

jefftkNov 21, 2021, 4:50 PM
13 points
6 comments2 min readLW link
(www.jefftk.com)

From lan­guage to ethics by au­to­mated reasoning

Michele CampoloNov 21, 2021, 3:16 PM
4 points
4 comments6 min readLW link

Split and Commit

Duncan Sabien (Inactive)Nov 21, 2021, 6:27 AM
191 points
34 comments7 min readLW link1 review

What’s the weirdest way to win this game?

Adam ScherlisNov 21, 2021, 5:18 AM
9 points
5 comments1 min readLW link
(adam.scherlis.com)

Eat the cute an­i­mals instead

Andrew VlahosNov 21, 2021, 1:06 AM
−4 points
2 comments1 min readLW link

Chris Voss ne­go­ti­a­tion MasterClass: review

VipulNaikNov 20, 2021, 10:39 PM
70 points
15 comments33 min readLW link

ACX Mon­treal Meetup Dec 4 2021

ENov 20, 2021, 5:49 PM
8 points
0 comments1 min readLW link

The Maker of MIND

Tomás B.Nov 20, 2021, 4:28 PM
112 points
19 comments11 min readLW link

South Bay ACX/​LW Meetup—CHANGED LOCATION

ISNov 20, 2021, 2:42 PM
11 points
0 comments1 min readLW link

The Em­peror’s New Clothes: a story of mo­ti­vated stupidity

David Hugh-Jones20 Nov 2021 13:24 UTC
10 points
5 comments3 min readLW link
(wyclif.substack.com)

[Book Re­view] “Sorceror’s Ap­pren­tice” by Tahir Shah

lsusr20 Nov 2021 11:29 UTC
92 points
11 comments7 min readLW link

Com­pe­tence/​Confidence

Duncan Sabien (Inactive)20 Nov 2021 8:59 UTC
60 points
19 comments1 min readLW link

Awe­some-github Post-Scarcity List

lorepieri20 Nov 2021 8:47 UTC
3 points
6 comments1 min readLW link

A Cer­tain For­mal­iza­tion of Cor­rigi­bil­ity Is VNM-Incoherent

TurnTrout20 Nov 2021 0:30 UTC
68 points
24 comments8 min readLW link

More de­tailed pro­posal for mea­sur­ing al­ign­ment of cur­rent models

Beth Barnes20 Nov 2021 0:03 UTC
31 points
0 comments8 min readLW link

Am­bi­tious Altru­is­tic Soft­ware Eng­ineer­ing Efforts: Op­por­tu­ni­ties and Benefits

ozziegooen19 Nov 2021 17:55 UTC
42 points
1 comment9 min readLW link
(forum.effectivealtruism.org)