LW/​ACX/​EA Seat­tle sum­mer meetup

nsokolsky24 Jun 2022 23:30 UTC
4 points
2 comments1 min readLW link

Depen­den­cies for AGI pessimism

Yitz24 Jun 2022 22:25 UTC
7 points
4 comments1 min readLW link

[Link] Child­care : what the sci­ence says

Gunnar_Zarncke24 Jun 2022 21:45 UTC
46 points
4 comments1 min readLW link
(criticalscience.medium.com)

What if the best path for a per­son who wants to work on AGI al­ign­ment is to join Face­book or Google?

dbasch24 Jun 2022 21:23 UTC
2 points
3 comments1 min readLW link

[Link] Ad­ver­sar­i­ally trained neu­ral rep­re­sen­ta­tions may already be as ro­bust as cor­re­spond­ing biolog­i­cal neu­ral representations

Gunnar_Zarncke24 Jun 2022 20:51 UTC
35 points
9 comments1 min readLW link

Up­dated Defer­ence is not a strong ar­gu­ment against the util­ity un­cer­tainty ap­proach to alignment

Ivan Vendrov24 Jun 2022 19:32 UTC
26 points
8 comments4 min readLW link

Cracks in the Wall, Part I: The Conscious

silo24 Jun 2022 18:29 UTC
−3 points
28 comments12 min readLW link
(stephenfoster.substack.com)

[Question] Do al­ign­ment con­cerns ex­tend to pow­er­ful non-AI agents?

Ozyrus24 Jun 2022 18:26 UTC
21 points
13 comments1 min readLW link

Raphaël Millière on Gen­er­al­iza­tion and Scal­ing Maximalism

Michaël Trazzi24 Jun 2022 18:18 UTC
21 points
2 comments4 min readLW link
(theinsideview.ai)

Worked Ex­am­ples of Shap­ley Values

lalaithion24 Jun 2022 17:13 UTC
75 points
11 comments8 min readLW link

Fea­ture re­quest: vot­ing but­tons at the bot­tom?

Oliver Sourbut24 Jun 2022 14:41 UTC
71 points
12 comments1 min readLW link

In­tel­li­gence in Com­mit­ment Races

David Udell24 Jun 2022 14:30 UTC
28 points
8 comments5 min readLW link

Linkpost: Robin Han­son—Why Not Wait On AI Risk?

Yair Halberstadt24 Jun 2022 14:23 UTC
41 points
14 comments1 min readLW link
(www.overcomingbias.com)

[Question] “Science Cathe­drals”

Alex Vermillion24 Jun 2022 3:30 UTC
22 points
9 comments1 min readLW link

LessWrong Has Agree/​Disagree Vot­ing On All New Com­ment Threads

Ben Pace24 Jun 2022 0:43 UTC
154 points
219 comments2 min readLW link1 review

Book re­view: The Pas­sen­ger by Lisa Lutz

KatjaGrace23 Jun 2022 23:10 UTC
12 points
1 comment1 min readLW link
(worldspiritsockpuppet.com)

20 Cri­tiques of AI Safety That I Found on Twitter

dkirmani23 Jun 2022 19:23 UTC
21 points
16 comments1 min readLW link

The Limits of Automation

milkandcigarettes23 Jun 2022 18:03 UTC
5 points
1 comment5 min readLW link
(milkandcigarettes.com)

[Question] Is CIRL a promis­ing agenda?

Chris_Leong23 Jun 2022 17:12 UTC
28 points
16 comments1 min readLW link

[Link] OpenAI: Learn­ing to Play Minecraft with Video PreTrain­ing (VPT)

Aryeh Englander23 Jun 2022 16:29 UTC
53 points
3 comments1 min readLW link

Half-baked AI Safety ideas thread

Aryeh Englander23 Jun 2022 16:11 UTC
64 points
63 comments1 min readLW link

Non­profit Boards are Weird

HoldenKarnofsky23 Jun 2022 14:40 UTC
158 points
26 comments20 min readLW link1 review
(www.cold-takes.com)

Covid 6/​23/​22: Un­der Five Alive

Zvi23 Jun 2022 14:00 UTC
26 points
9 comments10 min readLW link
(thezvi.wordpress.com)

How do states re­spond to changes in nu­clear risk

NathanBarnard23 Jun 2022 12:42 UTC
8 points
2 comments5 min readLW link

[Question] What’s the con­tin­gency plan if we get AGI to­mor­row?

Yitz23 Jun 2022 3:10 UTC
61 points
23 comments1 min readLW link

[Question] What are the best “policy” ap­proaches in wor­lds where al­ign­ment is difficult?

LHA23 Jun 2022 1:53 UTC
1 point
0 comments1 min readLW link

AI Train­ing Should Allow Opt-Out

alyssavance23 Jun 2022 1:33 UTC
76 points
13 comments6 min readLW link

Loose thoughts on AGI risk

Yitz23 Jun 2022 1:02 UTC
7 points
3 comments1 min readLW link

Air Con­di­tioner Test Re­sults & Discussion

johnswentworth22 Jun 2022 22:26 UTC
82 points
42 comments6 min readLW link

An­nounc­ing the LessWrong Cu­rated Podcast

22 Jun 2022 22:16 UTC
137 points
27 comments1 min readLW link

Google’s new text-to-image model—Parti, a demon­stra­tion of scal­ing benefits

Kayden22 Jun 2022 20:00 UTC
32 points
4 comments1 min readLW link

Build­ing an Epistemic Sta­tus Tracker

rcu22 Jun 2022 18:57 UTC
7 points
8 comments1 min readLW link

Con­fu­sion about neu­ro­science/​cog­ni­tive sci­ence as a dan­ger for AI Alignment

Samuel Nellessen22 Jun 2022 17:59 UTC
3 points
1 comment3 min readLW link
(snellessen.com)

[Question] How do I use caf­feine op­ti­mally?

randomstring22 Jun 2022 17:59 UTC
18 points
31 comments1 min readLW link

Make learn­ing a reality

Dalton Mabery22 Jun 2022 15:58 UTC
13 points
2 comments1 min readLW link

Reflec­tion Mechanisms as an Align­ment tar­get: A survey

22 Jun 2022 15:05 UTC
32 points
1 comment14 min readLW link

House Phone

jefftk22 Jun 2022 14:20 UTC
15 points
2 comments1 min readLW link
(www.jefftk.com)

How to Vi­su­al­ize Bayesianism

David Udell22 Jun 2022 13:57 UTC
9 points
2 comments3 min readLW link

[Question] Are there spaces for ex­tremely short-form ra­tio­nal­ity con­tent?

Aleksi Liimatainen22 Jun 2022 10:39 UTC
5 points
1 comment1 min readLW link

Sols­tice Movie Re­view: Sum­mer Wars

SebastianG 22 Jun 2022 1:09 UTC
22 points
6 comments1 min readLW link

Se­cu­rity Mind­set: Les­sons from 20+ years of Soft­ware Se­cu­rity Failures Rele­vant to AGI Alignment

elspood21 Jun 2022 23:55 UTC
369 points
42 comments7 min readLW link1 review

A Quick List of Some Prob­lems in AI Align­ment As A Field

Nicholas Kross21 Jun 2022 23:23 UTC
75 points
12 comments6 min readLW link
(www.thinkingmuchbetter.com)

[Question] What is the differ­ence be­tween AI mis­al­ign­ment and bad pro­gram­ming?

puzzleGuzzle21 Jun 2022 21:52 UTC
6 points
2 comments1 min readLW link

What I mean by the phrase “get­ting in­ti­mate with re­al­ity”

Luise21 Jun 2022 19:42 UTC
7 points
0 comments2 min readLW link
(forum.effectivealtruism.org)

What I mean by the phrase “tak­ing ideas se­ri­ously”

Luise21 Jun 2022 19:42 UTC
5 points
2 comments1 min readLW link
(forum.effectivealtruism.org)

Hy­dropho­bic Glasses Coat­ing Review

jefftk21 Jun 2022 18:00 UTC
16 points
6 comments1 min readLW link
(www.jefftk.com)

Progress links and tweets, 2022-06-20

jasoncrawford21 Jun 2022 17:12 UTC
12 points
2 comments1 min readLW link
(rootsofprogress.org)

De­bat­ing Whether AI is Con­scious Is A Dis­trac­tion from Real Problems

sidhe_they21 Jun 2022 16:56 UTC
2 points
10 comments1 min readLW link
(techpolicy.press)

Miti­gat­ing the dam­age from un­al­igned ASI by co­op­er­at­ing with aliens that don’t ex­ist yet

MSRayne21 Jun 2022 16:12 UTC
−8 points
7 comments6 min readLW link

The in­or­di­nately slow spread of good AGI con­ver­sa­tions in ML

Rob Bensinger21 Jun 2022 16:09 UTC
173 points
62 comments8 min readLW link