All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30

Book review: The Passenger by Lisa Lutz

KatjaGrace23 Jun 2022 23:10 UTC

12 points

1 comment1 min readLW link

(worldspiritsockpuppet.com)

20 Critiques of AI Safety That I Found on Twitter

dkirmani23 Jun 2022 19:23 UTC

21 points

16 comments1 min readLW link

The Limits of Automation

milkandcigarettes23 Jun 2022 18:03 UTC

5 points

1 comment5 min readLW link

(milkandcigarettes.com)

[Question] Is CIRL a promising agenda?

Chris_Leong23 Jun 2022 17:12 UTC

28 points

17 comments1 min readLW link

[Link] OpenAI: Learning to Play Minecraft with Video PreTraining (VPT)

Aryeh Englander23 Jun 2022 16:29 UTC

53 points

3 comments1 min readLW link

Half-baked AI Safety ideas thread

Aryeh Englander23 Jun 2022 16:11 UTC

65 points

63 comments1 min readLW link

Nonprofit Boards are Weird

HoldenKarnofsky23 Jun 2022 14:40 UTC

167 points

26 comments20 min readLW link 1 review

(www.cold-takes.com)

Covid 6/23/22: Under Five Alive

Zvi23 Jun 2022 14:00 UTC

26 points

9 comments10 min readLW link

(thezvi.wordpress.com)

How do states respond to changes in nuclear risk

NathanBarnard23 Jun 2022 12:42 UTC

8 points

2 comments5 min readLW link

[Question] What’s the contingency plan if we get AGI tomorrow?

Yitz23 Jun 2022 3:10 UTC

63 points

23 comments1 min readLW link

[Question] What are the best “policy” approaches in worlds where alignment is difficult?

LHA23 Jun 2022 1:53 UTC

1 point

0 comments1 min readLW link

AI Training Should Allow Opt-Out

alyssavance23 Jun 2022 1:33 UTC

76 points

13 comments6 min readLW link

Loose thoughts on AGI risk

Yitz23 Jun 2022 1:02 UTC

7 points

3 comments1 min readLW link

Air Conditioner Test Results & Discussion

johnswentworth22 Jun 2022 22:26 UTC

82 points

42 comments6 min readLW link

Announcing the LessWrong Curated Podcast

Ben Pace and Solenoid_Entity

22 Jun 2022 22:16 UTC

138 points

27 comments1 min readLW link

Google’s new text-to-image model—Parti, a demonstration of scaling benefits

Kayden22 Jun 2022 20:00 UTC

32 points

4 comments1 min readLW link

Building an Epistemic Status Tracker

rcu22 Jun 2022 18:57 UTC

7 points

8 comments1 min readLW link

Confusion about neuroscience/cognitive science as a danger for AI Alignment

Samuel Nellessen22 Jun 2022 17:59 UTC

3 points

1 comment3 min readLW link

(snellessen.com)

[Question] How do I use caffeine optimally?

randomstring22 Jun 2022 17:59 UTC

18 points

31 comments1 min readLW link

Make learning a reality

Dalton Mabery22 Jun 2022 15:58 UTC

13 points

2 comments1 min readLW link

Reflection Mechanisms as an Alignment target: A survey

Marius Hobbhahn, elandgre and Beth Barnes

22 Jun 2022 15:05 UTC

32 points

1 comment14 min readLW link

House Phone

jefftk22 Jun 2022 14:20 UTC

15 points

2 comments1 min readLW link

(www.jefftk.com)

How to Visualize Bayesianism

David Udell22 Jun 2022 13:57 UTC

9 points

2 comments3 min readLW link

[Question] Are there spaces for extremely short-form rationality content?

Aleksi Liimatainen22 Jun 2022 10:39 UTC

5 points

1 comment1 min readLW link

Solstice Movie Review: Summer Wars

SebastianG 22 Jun 2022 1:09 UTC

22 points

6 comments1 min readLW link

Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment

elspood21 Jun 2022 23:55 UTC

370 points

42 comments7 min readLW link 1 review

A Quick List of Some Problems in AI Alignment As A Field

Nicholas Kross21 Jun 2022 23:23 UTC

75 points

12 comments6 min readLW link

(www.thinkingmuchbetter.com)

[Question] What is the difference between AI misalignment and bad programming?

puzzleGuzzle21 Jun 2022 21:52 UTC

6 points

2 comments1 min readLW link

What I mean by the phrase “getting intimate with reality”

Luise Woehlke21 Jun 2022 19:42 UTC

7 points

0 comments2 min readLW link

(forum.effectivealtruism.org)

What I mean by the phrase “taking ideas seriously”

Luise Woehlke21 Jun 2022 19:42 UTC

5 points

2 comments1 min readLW link

(forum.effectivealtruism.org)

Hydrophobic Glasses Coating Review

jefftk21 Jun 2022 18:00 UTC

16 points

6 comments1 min readLW link

(www.jefftk.com)

Progress links and tweets, 2022-06-20

jasoncrawford21 Jun 2022 17:12 UTC

12 points

2 comments1 min readLW link

(rootsofprogress.org)

Debating Whether AI is Conscious Is A Distraction from Real Problems

sidhe_they21 Jun 2022 16:56 UTC

2 points

10 comments1 min readLW link

(techpolicy.press)

Mitigating the damage from unaligned ASI by cooperating with aliens that don’t exist yet

MSRayne21 Jun 2022 16:12 UTC

−8 points

7 comments6 min readLW link

The inordinately slow spread of good AGI conversations in ML

Rob Bensinger21 Jun 2022 16:09 UTC

173 points

62 comments8 min readLW link

Getting from an unaligned AGI to an aligned AGI?

Tor Økland Barstad21 Jun 2022 12:36 UTC

13 points

7 comments9 min readLW link

Common but neglected risk factors that may let you get Paxlovid

DirectedEvolution21 Jun 2022 7:34 UTC

29 points

8 comments4 min readLW link

Dagger of Detect Evil

lsusr21 Jun 2022 6:23 UTC

50 points

23 comments3 min readLW link

[Question] How easy/fast is it for a AGI to hack computers/a human brain?

Noosphere8921 Jun 2022 0:34 UTC

0 points

1 comment1 min readLW link

[Question] What is the most probable AI?

Zeruel01720 Jun 2022 23:26 UTC

−2 points

0 comments3 min readLW link

Evaluating a Corsi-Rosenthal Filter Cube

jefftk20 Jun 2022 19:40 UTC

13 points

4 comments1 min readLW link

(www.jefftk.com)

Survey re AIS/LTism office in NYC

RyanCarey20 Jun 2022 19:21 UTC

7 points

0 comments1 min readLW link

Is This Thing Sentient, Y/N?

Thane Ruthenis20 Jun 2022 18:37 UTC

4 points

10 comments7 min readLW link

Steam

abramdemski20 Jun 2022 17:38 UTC

156 points

13 comments5 min readLW link 1 review

Parable: The Bomb that doesn’t Explode

Lone Pine20 Jun 2022 16:41 UTC

14 points

5 comments2 min readLW link

On corrigibility and its basin

Donald Hobson20 Jun 2022 16:33 UTC

18 points

3 comments2 min readLW link

Announcing the DWATV Discord

Zvi20 Jun 2022 15:50 UTC

20 points

9 comments1 min readLW link

(thezvi.wordpress.com)

Key Papers in Language Model Safety

aog20 Jun 2022 15:00 UTC

40 points

1 comment22 min readLW link

Relationship Advice Repository

Ruby20 Jun 2022 14:39 UTC

110 points

36 comments38 min readLW link

Adaptation Executors and the Telos Margin

Plinthist20 Jun 2022 13:06 UTC

2 points

8 comments5 min readLW link