The harms you don’t see

ViktoriaMalyasova16 Oct 2022 23:45 UTC
63 points
54 comments10 min readLW link

Max­i­mal lot­ter­ies for value learning

ViktoriaMalyasova16 Oct 2022 23:44 UTC
17 points
1 comment5 min readLW link

Pop­u­lar Per­sonal Fi­nan­cial Ad­vice ver­sus the Pro­fes­sors (James Choi, NBER)

BrownHairedEevee16 Oct 2022 22:21 UTC
17 points
5 comments2 min readLW link
(spinup-000d1a-wp-offload-media.s3.amazonaws.com)

Life, Death, and Fi­nance in the Cos­mic Mul­ti­verse

peterb16 Oct 2022 18:57 UTC
2 points
1 comment1 min readLW link

[Question] Sig­nifi­cance of the Lan­guage of Thought Hy­poth­e­sis?

DrFlaggstaff16 Oct 2022 18:09 UTC
1 point
3 comments1 min readLW link

Luck based medicine: my re­sent­ful story of be­com­ing a med­i­cal miracle

Elizabeth16 Oct 2022 17:40 UTC
480 points
119 comments12 min readLW link3 reviews
(acesounderglass.com)

Age changes what you care about

Dentin16 Oct 2022 15:36 UTC
140 points
36 comments2 min readLW link

Hal­i­fax, NS – Monthly Ra­tion­al­ist, EA, and ACX Meetup Kick-Off

Ideopunk16 Oct 2022 13:17 UTC
10 points
0 comments1 min readLW link

Cruxes in Katja Grace’s Counterarguments

azsantosk16 Oct 2022 8:44 UTC
16 points
0 comments7 min readLW link

Build­ing the Loft Beds

jefftk16 Oct 2022 1:10 UTC
10 points
4 comments1 min readLW link
(www.jefftk.com)

[Question] Best re­source to go from “typ­i­cal smart tech-savvy per­son” to “per­son who gets AGI risk ur­gency”?

Liron15 Oct 2022 22:26 UTC
16 points
8 comments1 min readLW link

Bounded dis­trust or Bounded trust?

M. Y. Zuo15 Oct 2022 16:41 UTC
2 points
12 comments3 min readLW link

I learn bet­ter when I frame learn­ing as Vengeance for losses in­curred through ig­no­rance, and you might too

chaosmage15 Oct 2022 12:41 UTC
79 points
9 comments3 min readLW link1 review

James Nor­ris from Upgrad­able on “What is Beyond Liv­ing a Prin­ci­pled Life”—OpenPrin­ci­ples Speaker Session

ti_guo15 Oct 2022 3:27 UTC
2 points
0 comments1 min readLW link

Quick Mock Brownie

jefftk15 Oct 2022 3:00 UTC
8 points
0 comments1 min readLW link
(www.jefftk.com)

A com­mon failure for foxes

Rob Bensinger14 Oct 2022 22:50 UTC
47 points
7 comments2 min readLW link

“AGI soon, but Nar­row works Bet­ter”

AnthonyRepetto14 Oct 2022 21:35 UTC
1 point
9 comments2 min readLW link

[Job]: AI Stan­dards Devel­op­ment Re­search Assistant

Tony Barrett14 Oct 2022 20:27 UTC
2 points
0 comments2 min readLW link

Me­tac­u­lus Launches the ‘Fore­cast­ing Our World In Data’ Pro­ject to Probe the Long-Term Future

ChristianWilliams14 Oct 2022 17:00 UTC
15 points
0 comments1 min readLW link

In­stru­men­tal con­ver­gence: scale and phys­i­cal interactions

14 Oct 2022 15:50 UTC
15 points
0 comments17 min readLW link
(www.gladstone.ai)

Coun­ter­ar­gu­ments to the ba­sic AI x-risk case

KatjaGrace14 Oct 2022 13:00 UTC
369 points
124 comments34 min readLW link1 review
(aiimpacts.org)

Another prob­lem with AI con­fine­ment: or­di­nary CPUs can work as ra­dio transmitters

RomanS14 Oct 2022 8:28 UTC
35 points
1 comment1 min readLW link
(news.softpedia.com)

[Question] How much of China’s Zero COVID policy is ac­tu­ally about COVID?

jmh14 Oct 2022 7:23 UTC
9 points
3 comments1 min readLW link

Max­i­mally Sim­ple Belt San­der Stand

jefftk14 Oct 2022 0:30 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

Con­tra shard the­ory, in the con­text of the di­a­mond max­i­mizer problem

So8res13 Oct 2022 23:51 UTC
101 points
19 comments2 min readLW link1 review

Greed Is the Root of This Evil

Thane Ruthenis13 Oct 2022 20:40 UTC
18 points
7 comments8 min readLW link

Ve­hi­cle Pla­toon­ing—a real world ex­am­i­na­tion of the difficul­ties in coordination

M. Y. Zuo13 Oct 2022 19:33 UTC
24 points
6 comments2 min readLW link

The Vi­talik Bu­terin Fel­low­ship in AI Ex­is­ten­tial Safety is open for ap­pli­ca­tions!

Cynthia Chen13 Oct 2022 18:32 UTC
21 points
0 comments1 min readLW link

Feelings

Oren Montano13 Oct 2022 17:48 UTC
9 points
0 comments9 min readLW link

Against the nor­ma­tive re­al­ist’s wager

Joe Carlsmith13 Oct 2022 16:35 UTC
16 points
9 comments23 min readLW link

Weekly Non-Covid News #1 (10/​13/​22)

Zvi13 Oct 2022 15:40 UTC
52 points
16 comments16 min readLW link
(thezvi.wordpress.com)

Misal­ign­ment-by-de­fault in multi-agent systems

13 Oct 2022 15:38 UTC
19 points
8 comments20 min readLW link
(www.gladstone.ai)

A stub­born un­be­liever fi­nally gets the depth of the AI al­ign­ment problem

aelwood13 Oct 2022 15:16 UTC
17 points
8 comments3 min readLW link
(pursuingreality.substack.com)

Covid 10/​13/​22: Just the Facts

Zvi13 Oct 2022 14:40 UTC
28 points
7 comments10 min readLW link
(thezvi.wordpress.com)

When should you defer to ex­per­tise? A use­ful heuris­tic (Cross­post from EA fo­rum)

Noosphere8913 Oct 2022 14:14 UTC
9 points
3 comments2 min readLW link
(forum.effectivealtruism.org)

Cat­a­logu­ing Pri­ors in The­ory and Practice

Paul Bricman13 Oct 2022 12:36 UTC
13 points
8 comments7 min readLW link

Trans­for­ma­tive VR Is Likely Com­ing Soon

jimrandomh13 Oct 2022 6:25 UTC
92 points
46 comments2 min readLW link

Cam­bridge LW Meetup: See the Invisible

Tony Wang13 Oct 2022 5:44 UTC
1 point
0 comments1 min readLW link

Glos­sary Dance Game

jefftk13 Oct 2022 2:20 UTC
10 points
1 comment2 min readLW link
(www.jefftk.com)

Nice­ness is unnatural

So8res13 Oct 2022 1:30 UTC
121 points
20 comments8 min readLW link1 review

A strange twist on the road to AGI

cveres12 Oct 2022 23:27 UTC
−8 points
0 comments1 min readLW link

Help out Red­wood Re­search’s in­ter­pretabil­ity team by find­ing heuris­tics im­ple­mented by GPT-2 small

12 Oct 2022 21:25 UTC
50 points
11 comments4 min readLW link

Towards a com­pre­hen­sive study of po­ten­tial psy­cholog­i­cal causes of the or­di­nary range of vari­a­tion of af­fec­tive gen­der iden­tity in males

tailcalled12 Oct 2022 21:10 UTC
52 points
4 comments37 min readLW link

Six (and a half) in­tu­itions for KL divergence

CallumMcDougall12 Oct 2022 21:07 UTC
154 points
25 comments10 min readLW link1 review
(www.perfectlynormal.co.uk)

[MLSN #6]: Trans­parency sur­vey, prov­able ro­bust­ness, ML mod­els that pre­dict the future

Dan H12 Oct 2022 20:56 UTC
27 points
0 comments6 min readLW link

[Question] Pre­vi­ous Work on Re­cre­at­ing Neu­ral Net­work In­put from In­ter­me­di­ate Layer Activations

bglass12 Oct 2022 19:28 UTC
1 point
3 comments1 min readLW link

Be more effec­tive by learn­ing im­por­tant prac­ti­cal knowl­edge us­ing flashcards

Stenemo12 Oct 2022 18:05 UTC
5 points
2 comments1 min readLW link

Ar­ti­cle Re­view: Google’s AlphaTensor

Robert_AIZI12 Oct 2022 18:04 UTC
8 points
4 comments10 min readLW link

Align­ment 201 curriculum

Richard_Ngo12 Oct 2022 18:03 UTC
102 points
3 comments1 min readLW link
(www.agisafetyfundamentals.com)

Progress links and tweets, 2022-10-12

jasoncrawford12 Oct 2022 16:59 UTC
8 points
0 comments1 min readLW link
(rootsofprogress.org)