Book re­view: The Pas­sen­ger by Lisa Lutz

KatjaGrace23 Jun 2022 23:10 UTC
12 points
1 comment1 min readLW link
(worldspiritsockpuppet.com)

20 Cri­tiques of AI Safety That I Found on Twitter

dkirmani23 Jun 2022 19:23 UTC
21 points
16 comments1 min readLW link

The Limits of Automation

milkandcigarettes23 Jun 2022 18:03 UTC
5 points
1 comment5 min readLW link
(milkandcigarettes.com)

[Question] Is CIRL a promis­ing agenda?

Chris_Leong23 Jun 2022 17:12 UTC
28 points
16 comments1 min readLW link

[Link] OpenAI: Learn­ing to Play Minecraft with Video PreTrain­ing (VPT)

Aryeh Englander23 Jun 2022 16:29 UTC
53 points
3 comments1 min readLW link

Half-baked AI Safety ideas thread

Aryeh Englander23 Jun 2022 16:11 UTC
64 points
61 comments1 min readLW link

Non­profit Boards are Weird

HoldenKarnofsky23 Jun 2022 14:40 UTC
154 points
26 comments20 min readLW link1 review
(www.cold-takes.com)

Covid 6/​23/​22: Un­der Five Alive

Zvi23 Jun 2022 14:00 UTC
26 points
9 comments10 min readLW link
(thezvi.wordpress.com)

How do states re­spond to changes in nu­clear risk

NathanBarnard23 Jun 2022 12:42 UTC
8 points
2 comments5 min readLW link

[Question] What’s the con­tin­gency plan if we get AGI to­mor­row?

Yitz23 Jun 2022 3:10 UTC
61 points
23 comments1 min readLW link

[Question] What are the best “policy” ap­proaches in wor­lds where al­ign­ment is difficult?

LHA23 Jun 2022 1:53 UTC
1 point
0 comments1 min readLW link

AI Train­ing Should Allow Opt-Out

alyssavance23 Jun 2022 1:33 UTC
76 points
13 comments6 min readLW link

Loose thoughts on AGI risk

Yitz23 Jun 2022 1:02 UTC
7 points
3 comments1 min readLW link

Air Con­di­tioner Test Re­sults & Discussion

johnswentworth22 Jun 2022 22:26 UTC
82 points
42 comments6 min readLW link

An­nounc­ing the LessWrong Cu­rated Podcast

22 Jun 2022 22:16 UTC
137 points
27 comments1 min readLW link

Google’s new text-to-image model—Parti, a demon­stra­tion of scal­ing benefits

Kayden22 Jun 2022 20:00 UTC
32 points
4 comments1 min readLW link

Build­ing an Epistemic Sta­tus Tracker

rcu22 Jun 2022 18:57 UTC
7 points
6 comments1 min readLW link

Con­fu­sion about neu­ro­science/​cog­ni­tive sci­ence as a dan­ger for AI Alignment

Samuel Nellessen22 Jun 2022 17:59 UTC
2 points
1 comment3 min readLW link
(snellessen.com)

[Question] How do I use caf­feine op­ti­mally?

randomstring22 Jun 2022 17:59 UTC
18 points
31 comments1 min readLW link

Make learn­ing a reality

Dalton Mabery22 Jun 2022 15:58 UTC
13 points
2 comments1 min readLW link

Reflec­tion Mechanisms as an Align­ment tar­get: A survey

22 Jun 2022 15:05 UTC
32 points
1 comment14 min readLW link

House Phone

jefftk22 Jun 2022 14:20 UTC
15 points
2 comments1 min readLW link
(www.jefftk.com)

How to Vi­su­al­ize Bayesianism

David Udell22 Jun 2022 13:57 UTC
9 points
2 comments3 min readLW link

[Question] Are there spaces for ex­tremely short-form ra­tio­nal­ity con­tent?

Aleksi Liimatainen22 Jun 2022 10:39 UTC
4 points
1 comment1 min readLW link

Sols­tice Movie Re­view: Sum­mer Wars

JohnBuridan22 Jun 2022 1:09 UTC
22 points
6 comments1 min readLW link

Se­cu­rity Mind­set: Les­sons from 20+ years of Soft­ware Se­cu­rity Failures Rele­vant to AGI Alignment

elspood21 Jun 2022 23:55 UTC
361 points
42 comments7 min readLW link1 review

A Quick List of Some Prob­lems in AI Align­ment As A Field

NicholasKross21 Jun 2022 23:23 UTC
75 points
12 comments6 min readLW link
(www.thinkingmuchbetter.com)

[Question] What is the differ­ence be­tween AI mis­al­ign­ment and bad pro­gram­ming?

puzzleGuzzle21 Jun 2022 21:52 UTC
6 points
2 comments1 min readLW link

What I mean by the phrase “get­ting in­ti­mate with re­al­ity”

Luise21 Jun 2022 19:42 UTC
6 points
0 comments2 min readLW link
(forum.effectivealtruism.org)

What I mean by the phrase “tak­ing ideas se­ri­ously”

Luise21 Jun 2022 19:42 UTC
5 points
2 comments1 min readLW link
(forum.effectivealtruism.org)

Hy­dropho­bic Glasses Coat­ing Review

jefftk21 Jun 2022 18:00 UTC
16 points
6 comments1 min readLW link
(www.jefftk.com)

Progress links and tweets, 2022-06-20

jasoncrawford21 Jun 2022 17:12 UTC
12 points
2 comments1 min readLW link
(rootsofprogress.org)

De­bat­ing Whether AI is Con­scious Is A Dis­trac­tion from Real Problems

sidhe_they21 Jun 2022 16:56 UTC
2 points
10 comments1 min readLW link
(techpolicy.press)

Miti­gat­ing the dam­age from un­al­igned ASI by co­op­er­at­ing with aliens that don’t ex­ist yet

MSRayne21 Jun 2022 16:12 UTC
−8 points
7 comments6 min readLW link

The in­or­di­nately slow spread of good AGI con­ver­sa­tions in ML

Rob Bensinger21 Jun 2022 16:09 UTC
173 points
62 comments8 min readLW link

Get­ting from an un­al­igned AGI to an al­igned AGI?

Tor Økland Barstad21 Jun 2022 12:36 UTC
13 points
7 comments9 min readLW link

Com­mon but ne­glected risk fac­tors that may let you get Paxlovid

DirectedEvolution21 Jun 2022 7:34 UTC
29 points
8 comments4 min readLW link

Dag­ger of De­tect Evil

lsusr21 Jun 2022 6:23 UTC
38 points
20 comments3 min readLW link

[Question] How easy/​fast is it for a AGI to hack com­put­ers/​a hu­man brain?

Noosphere8921 Jun 2022 0:34 UTC
0 points
1 comment1 min readLW link

[Question] What is the most prob­a­ble AI?

Zeruel01720 Jun 2022 23:26 UTC
−2 points
0 comments3 min readLW link

Eval­u­at­ing a Corsi-Rosen­thal Filter Cube

jefftk20 Jun 2022 19:40 UTC
13 points
3 comments1 min readLW link
(www.jefftk.com)

Sur­vey re AIS/​LTism office in NYC

RyanCarey20 Jun 2022 19:21 UTC
7 points
0 comments1 min readLW link

Is This Thing Sen­tient, Y/​N?

Thane Ruthenis20 Jun 2022 18:37 UTC
4 points
9 comments7 min readLW link

Steam

abramdemski20 Jun 2022 17:38 UTC
134 points
13 comments5 min readLW link1 review

Parable: The Bomb that doesn’t Explode

Lone Pine20 Jun 2022 16:41 UTC
14 points
5 comments2 min readLW link

On cor­rigi­bil­ity and its basin

Donald Hobson20 Jun 2022 16:33 UTC
16 points
3 comments2 min readLW link

An­nounc­ing the DWATV Discord

Zvi20 Jun 2022 15:50 UTC
20 points
9 comments1 min readLW link
(thezvi.wordpress.com)

Key Papers in Lan­guage Model Safety

aogara20 Jun 2022 15:00 UTC
39 points
1 comment22 min readLW link

Re­la­tion­ship Ad­vice Repository

Ruby20 Jun 2022 14:39 UTC
102 points
36 comments39 min readLW link

Adap­ta­tion Ex­ecu­tors and the Telos Margin

Plinthist20 Jun 2022 13:06 UTC
2 points
8 comments5 min readLW link