Distri­bu­tion Shifts and The Im­por­tance of AI Safety

Leon Lang29 Sep 2022 22:38 UTC
17 points
2 comments12 min readLW link

Clar­ify­ing the Agent-Like Struc­ture Problem

johnswentworth29 Sep 2022 21:28 UTC
58 points
15 comments6 min readLW link

Where I cur­rently dis­agree with Ryan Green­blatt’s ver­sion of the ELK approach

So8res29 Sep 2022 21:18 UTC
65 points
7 comments5 min readLW link

It mat­ters when the first sharp left turn happens

Adam Jermyn29 Sep 2022 20:12 UTC
44 points
9 comments4 min readLW link

Covid 9/​29/​22: The Jones Act Waver

Zvi29 Sep 2022 18:20 UTC
47 points
10 comments24 min readLW link
(thezvi.wordpress.com)

High-Im­pact Psy­chol­ogy (HIPsy): Pilot­ing a Global Network

Inga G.29 Sep 2022 18:16 UTC
8 points
0 comments1 min readLW link

Unit Test Everything

DirectedEvolution29 Sep 2022 18:12 UTC
30 points
0 comments8 min readLW link

Builder/​Breaker for Deconfusion

abramdemski29 Sep 2022 17:36 UTC
72 points
9 comments9 min readLW link

[Question] Re­sources to find/​reg­ister the ra­tio­nal­ists that spe­cial­ize in a given topic?

tailcalled29 Sep 2022 17:20 UTC
25 points
8 comments1 min readLW link

Make-A-Video by Meta AI

P.29 Sep 2022 17:07 UTC
9 points
4 comments1 min readLW link
(makeavideo.studio)

FDT is not di­rectly com­pa­rable to CDT and EDT

Sylvester Kollin29 Sep 2022 14:42 UTC
36 points
8 comments21 min readLW link

[Link] “Im­proper Nouns” by siderea

Kenny29 Sep 2022 13:28 UTC
17 points
3 comments1 min readLW link
(siderea.dreamwidth.org)

Open ap­pli­ca­tion to be­come an AI safety pro­ject mentor

Charbel-Raphaël29 Sep 2022 11:27 UTC
10 points
0 comments1 min readLW link
(docs.google.com)

[Question] Should rea­son­ably healthy peo­ple still take Paxlovid?

Sameerishere29 Sep 2022 3:41 UTC
15 points
2 comments1 min readLW link

Reflec­tion on a Con­sult­ing Workshop

myutin29 Sep 2022 3:04 UTC
12 points
1 comment3 min readLW link

Bet­ter Con­struc­tion Cost Es­ti­mates?

jefftk29 Sep 2022 2:30 UTC
12 points
4 comments2 min readLW link
(www.jefftk.com)

Petrov Day Ret­ro­spec­tive: 2022

Ruby28 Sep 2022 22:16 UTC
107 points
41 comments4 min readLW link

Es­ti­mat­ing the Cur­rent and Fu­ture Num­ber of AI Safety Researchers

Stephen McAleese28 Sep 2022 21:11 UTC
46 points
14 comments9 min readLW link
(forum.effectivealtruism.org)

Progress links and tweets, 2022-09-28

jasoncrawford28 Sep 2022 20:26 UTC
13 points
1 comment1 min readLW link
(rootsofprogress.org)

EA & LW Fo­rums Weekly Sum­mary (19 − 25 Sep 22′)

Zoe Williams28 Sep 2022 20:18 UTC
16 points
2 comments19 min readLW link

LOVE in a sim­box is all you need

jacob_cannell28 Sep 2022 18:25 UTC
63 points
72 comments44 min readLW link1 review

A Library and Tu­to­rial for Fac­tored Cog­ni­tion with Lan­guage Models

28 Sep 2022 18:15 UTC
47 points
0 comments1 min readLW link

Re­ward IS the Op­ti­miza­tion Target

Carn28 Sep 2022 17:59 UTC
−2 points
3 comments5 min readLW link

AI Safety Endgame Stories

Ivan Vendrov28 Sep 2022 16:58 UTC
31 points
11 comments11 min readLW link

Will Values and Com­pe­ti­tion De­cou­ple?

interstice28 Sep 2022 16:27 UTC
15 points
11 comments17 min readLW link

Ge­or­gism in Space

harsimony28 Sep 2022 16:05 UTC
41 points
12 comments4 min readLW link
(harsimony.wordpress.com)

QAPR 3: in­ter­pretabil­ity-guided train­ing of neu­ral nets

Quintin Pope28 Sep 2022 16:02 UTC
58 points
2 comments10 min readLW link

Strange Loops—Self-Refer­ence from Num­ber The­ory to AI

ojorgensen28 Sep 2022 14:10 UTC
15 points
6 comments18 min readLW link

Why I think strong gen­eral AI is com­ing soon

porby28 Sep 2022 5:40 UTC
325 points
139 comments34 min readLW link1 review

About Q Home

Q Home28 Sep 2022 4:56 UTC
11 points
4 comments1 min readLW link

[Linkpost] “In­ten­sity and fre­quency of ex­treme novel epi­demics” by Mar­i­ani et al. (2021)

Fer32dwt34r3dfsz28 Sep 2022 3:31 UTC
10 points
0 comments1 min readLW link

Threat-Re­sis­tant Bar­gain­ing Me­ga­post: In­tro­duc­ing the ROSE Value

Diffractor28 Sep 2022 1:20 UTC
143 points
19 comments53 min readLW link2 reviews

7 traps that (we think) new al­ign­ment re­searchers of­ten fall into

27 Sep 2022 23:13 UTC
174 points
10 comments4 min readLW link

Failure modes in a shard the­ory al­ign­ment plan

Thomas Kwa27 Sep 2022 22:34 UTC
26 points
2 comments7 min readLW link

[Question] Is a PhD nec­es­sary to con­tribute mean­ingfully to a field?

TrudosKudos27 Sep 2022 21:27 UTC
4 points
7 comments1 min readLW link

Why we’re not found­ing a hu­man-data-for-al­ign­ment org

27 Sep 2022 20:14 UTC
88 points
5 comments29 min readLW link
(forum.effectivealtruism.org)

A Poorly Planned Loft Bed

jefftk27 Sep 2022 17:50 UTC
9 points
2 comments1 min readLW link
(www.jefftk.com)

Wise Crowd & Demo­cratic Spirit

Hristo Zaykov27 Sep 2022 17:45 UTC
1 point
0 comments2 min readLW link
(www.hristo.blog)

Soft skills for meetups

mingyuan27 Sep 2022 17:26 UTC
48 points
3 comments5 min readLW link

[Question] En­rich­ing Youtube con­tent recommendations

Martín Soto27 Sep 2022 16:54 UTC
8 points
4 comments1 min readLW link

ex­is­ten­tial self-determination

Tamsin Leake27 Sep 2022 16:08 UTC
14 points
2 comments2 min readLW link
(carado.moe)

The Onion Test for Per­sonal and In­sti­tu­tional Honesty

27 Sep 2022 15:26 UTC
156 points
31 comments3 min readLW link3 reviews

Book re­view: “The Heart of the Brain: The Hy­potha­la­mus and Its Hor­mones”

Steven Byrnes27 Sep 2022 13:20 UTC
65 points
3 comments18 min readLW link

My Thoughts on the ML Safety Course

zeshen27 Sep 2022 13:15 UTC
50 points
3 comments17 min readLW link

Sum­mary of ML Safety Course

zeshen27 Sep 2022 13:05 UTC
7 points
0 comments6 min readLW link

Prob­a­bil­is­tic rea­son­ing for de­scrip­tion and experience

Q Home27 Sep 2022 10:57 UTC
0 points
0 comments26 min readLW link

A Prince, a Pau­per, Power, Panama

Alok Singh27 Sep 2022 7:10 UTC
10 points
0 comments1 min readLW link
(alok.github.io)

Dou­ble As­teroid Redi­rec­tion Test succeeds

sanxiyn27 Sep 2022 6:37 UTC
19 points
5 comments1 min readLW link
(twitter.com)

[Question] How would I know if a PhD is the right ca­reer path?

Bob Guran27 Sep 2022 5:49 UTC
4 points
4 comments1 min readLW link

Re­view of Ex­am­ine.com’s vi­tamin write-ups

26 Sep 2022 23:40 UTC
59 points
1 comment5 min readLW link
(acesounderglass.com)