Robin Han­son asks “Why Not Wait On AI Risk?”

Gunnar_Zarncke26 Jun 2022 23:32 UTC
22 points
4 comments1 min readLW link
(www.overcomingbias.com)

Sex Fairy Lore

pchvykov26 Jun 2022 20:42 UTC
−25 points
10 comments6 min readLW link

King David’s %: Estab­lish­ing a new sym­bol for Bayesian prob­a­bil­ity.

Paul Logan26 Jun 2022 19:47 UTC
−11 points
1 comment5 min readLW link
(laulpogan.substack.com)

Train­ing Trace Pri­ors and Speed Priors

Adam Jermyn26 Jun 2022 18:07 UTC
17 points
0 comments3 min readLW link

My cur­rent take on In­ter­nal Fam­ily Sys­tems “parts”

Kaj_Sotala26 Jun 2022 17:40 UTC
98 points
11 comments3 min readLW link
(kajsotala.fi)

A Quick On­tol­ogy of Agreement

ravedon26 Jun 2022 17:39 UTC
5 points
2 comments2 min readLW link

Seven ways to be­come un­stop­pably agentic

Evie Cottrell26 Jun 2022 17:39 UTC
66 points
16 comments8 min readLW link

For­mal­iz­ing Deception

JamesH26 Jun 2022 17:39 UTC
14 points
2 comments5 min readLW link

Dust The­ory vs Ruliad

svemirski26 Jun 2022 16:08 UTC
3 points
0 comments1 min readLW link

My cog­ni­tive in­er­tia cycle

MSRayne26 Jun 2022 15:49 UTC
7 points
4 comments4 min readLW link

How do poor coun­tries get rich: some the­o­ries

NathanBarnard26 Jun 2022 10:41 UTC
8 points
2 comments10 min readLW link

Child Contracting

jefftk26 Jun 2022 2:30 UTC
48 points
2 comments1 min readLW link
(www.jefftk.com)

Con­di­tion­ing Gen­er­a­tive Models

Adam Jermyn25 Jun 2022 22:15 UTC
24 points
18 comments10 min readLW link

Unforgivable

Novalis25 Jun 2022 20:57 UTC
−9 points
12 comments5 min readLW link
(novalis.blog)

SunPJ in Alenia

FlorianH25 Jun 2022 19:39 UTC
9 points
19 comments8 min readLW link
(plausiblestuff.com)

[Question] Should any hu­man en­slave an AGI sys­tem?

AlignmentMirror25 Jun 2022 19:35 UTC
−13 points
44 comments1 min readLW link

Fun­da­men­tal Uncer­tainty: Chap­ter 3 - Why don’t we agree on what’s right?

Gordon Seidoh Worley25 Jun 2022 17:50 UTC
27 points
22 comments14 min readLW link

[Question] How “should” coun­ter­fac­tual pre­dic­tion mar­kets work?

eapi25 Jun 2022 17:44 UTC
9 points
6 comments1 min readLW link

Con­ver­sa­tion with Eliezer: What do you want the sys­tem to do?

Orpheus1625 Jun 2022 17:36 UTC
114 points
38 comments2 min readLW link

AI-Writ­ten Cri­tiques Help Hu­mans No­tice Flaws

paulfchristiano25 Jun 2022 17:22 UTC
137 points
5 comments3 min readLW link
(openai.com)

Some re­flec­tions on the LW com­mu­nity af­ter sev­eral months of ac­tive engagement

M. Y. Zuo25 Jun 2022 17:04 UTC
72 points
40 comments4 min readLW link

On The Spec­trum, On The Guest List: (vii) The Marquee

party girl25 Jun 2022 16:54 UTC
5 points
0 comments19 min readLW link
(onthespectrumontheguestlist.substack.com)

Iden­ti­fi­ca­tion of Nat­u­ral Modularity

Stephen Fowler25 Jun 2022 15:05 UTC
15 points
3 comments7 min readLW link

[LQ] Some Thoughts on Mes­sag­ing Around AI Risk

DragonGod25 Jun 2022 13:53 UTC
5 points
3 comments6 min readLW link

Quick Sum­maries of Two Papers on Kant and Game Theory

Erich_Grunewald25 Jun 2022 10:25 UTC
8 points
2 comments4 min readLW link
(www.erichgrunewald.com)

[Question] Do you con­sider your cur­rent, non-su­per­hu­man self al­igned with “hu­man­ity” already?

Rana Dexsin25 Jun 2022 4:15 UTC
13 points
19 comments1 min readLW link

LW/​ACX/​EA Seat­tle sum­mer meetup

nsokolsky24 Jun 2022 23:30 UTC
4 points
2 comments1 min readLW link

Depen­den­cies for AGI pessimism

Yitz24 Jun 2022 22:25 UTC
7 points
4 comments1 min readLW link

[Link] Child­care : what the sci­ence says

Gunnar_Zarncke24 Jun 2022 21:45 UTC
46 points
4 comments1 min readLW link
(criticalscience.medium.com)

What if the best path for a per­son who wants to work on AGI al­ign­ment is to join Face­book or Google?

dbasch24 Jun 2022 21:23 UTC
2 points
3 comments1 min readLW link

[Link] Ad­ver­sar­i­ally trained neu­ral rep­re­sen­ta­tions may already be as ro­bust as cor­re­spond­ing biolog­i­cal neu­ral representations

Gunnar_Zarncke24 Jun 2022 20:51 UTC
35 points
9 comments1 min readLW link

Up­dated Defer­ence is not a strong ar­gu­ment against the util­ity un­cer­tainty ap­proach to alignment

Ivan Vendrov24 Jun 2022 19:32 UTC
26 points
8 comments4 min readLW link

Cracks in the Wall, Part I: The Conscious

silo24 Jun 2022 18:29 UTC
−3 points
28 comments12 min readLW link
(stephenfoster.substack.com)

[Question] Do al­ign­ment con­cerns ex­tend to pow­er­ful non-AI agents?

Ozyrus24 Jun 2022 18:26 UTC
21 points
13 comments1 min readLW link

Raphaël Millière on Gen­er­al­iza­tion and Scal­ing Maximalism

Michaël Trazzi24 Jun 2022 18:18 UTC
21 points
2 comments4 min readLW link
(theinsideview.ai)

Worked Ex­am­ples of Shap­ley Values

lalaithion24 Jun 2022 17:13 UTC
75 points
11 comments8 min readLW link

Fea­ture re­quest: vot­ing but­tons at the bot­tom?

Oliver Sourbut24 Jun 2022 14:41 UTC
71 points
12 comments1 min readLW link

In­tel­li­gence in Com­mit­ment Races

David Udell24 Jun 2022 14:30 UTC
28 points
8 comments5 min readLW link

Linkpost: Robin Han­son—Why Not Wait On AI Risk?

Yair Halberstadt24 Jun 2022 14:23 UTC
41 points
14 comments1 min readLW link
(www.overcomingbias.com)

[Question] “Science Cathe­drals”

Alex Vermillion24 Jun 2022 3:30 UTC
22 points
9 comments1 min readLW link

LessWrong Has Agree/​Disagree Vot­ing On All New Com­ment Threads

Ben Pace24 Jun 2022 0:43 UTC
155 points
219 comments2 min readLW link1 review

Book re­view: The Pas­sen­ger by Lisa Lutz

KatjaGrace23 Jun 2022 23:10 UTC
12 points
1 comment1 min readLW link
(worldspiritsockpuppet.com)

20 Cri­tiques of AI Safety That I Found on Twitter

dkirmani23 Jun 2022 19:23 UTC
21 points
16 comments1 min readLW link

The Limits of Automation

milkandcigarettes23 Jun 2022 18:03 UTC
5 points
1 comment5 min readLW link
(milkandcigarettes.com)

[Question] Is CIRL a promis­ing agenda?

Chris_Leong23 Jun 2022 17:12 UTC
28 points
16 comments1 min readLW link

[Link] OpenAI: Learn­ing to Play Minecraft with Video PreTrain­ing (VPT)

Aryeh Englander23 Jun 2022 16:29 UTC
53 points
3 comments1 min readLW link

Half-baked AI Safety ideas thread

Aryeh Englander23 Jun 2022 16:11 UTC
64 points
63 comments1 min readLW link

Non­profit Boards are Weird

HoldenKarnofsky23 Jun 2022 14:40 UTC
158 points
26 comments20 min readLW link1 review
(www.cold-takes.com)

Covid 6/​23/​22: Un­der Five Alive

Zvi23 Jun 2022 14:00 UTC
26 points
9 comments10 min readLW link
(thezvi.wordpress.com)

How do states re­spond to changes in nu­clear risk

NathanBarnard23 Jun 2022 12:42 UTC
8 points
2 comments5 min readLW link