All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Deferring

owencb12 May 2022 23:56 UTC

18 points

2 comments11 min readLW link

RLHF

Ansh Radhakrishnan12 May 2022 21:18 UTC

18 points

5 comments5 min readLW link

[Question] What to do when starting a business in an imminent-AGI world?

ryan_b12 May 2022 21:07 UTC

25 points

7 comments1 min readLW link

Interpretability’s Alignment-Solving Potential: Analysis of 7 Scenarios

Evan R. Murphy12 May 2022 20:01 UTC

58 points

0 comments59 min readLW link

Introduction to the sequence: Interpretability Research for the Most Important Century

Evan R. Murphy12 May 2022 19:59 UTC

16 points

0 comments8 min readLW link

A tentative dialogue with a Friendly-boxed-super-AGI on brain uploads

Ramiro P.12 May 2022 19:40 UTC

1 point

12 comments4 min readLW link

The Last Paperclip

Logan Zoellner12 May 2022 19:25 UTC

65 points

15 comments18 min readLW link

Deepmind’s Gato: Generalist Agent

Daniel Kokotajlo12 May 2022 16:01 UTC

166 points

62 comments1 min readLW link

“A Generalist Agent”: New DeepMind Publication

1a3orn12 May 2022 15:30 UTC

79 points

43 comments1 min readLW link

Covid 5/12/22: Other Priorities

Zvi12 May 2022 13:30 UTC

31 points

4 comments15 min readLW link

(thezvi.wordpress.com)

[Question] How would public media outlets need to be governed to cover all political views?

ChristianKl12 May 2022 12:55 UTC

13 points

14 comments1 min readLW link

[Question] What’s keeping concerned capabilities gain researchers from leaving the field?

sovran12 May 2022 12:16 UTC

19 points

4 comments1 min readLW link

Positive outcomes under an unaligned AGI takeover

Yitz12 May 2022 7:45 UTC

19 points

10 comments3 min readLW link

[Question] What are your recommendations for technical AI alignment podcasts?

Evan_Gaensbauer11 May 2022 21:52 UTC

5 points

4 comments1 min readLW link

Gracefully correcting uncalibrated shame

AF202211 May 2022 19:51 UTC

−31 points

34 comments4 min readLW link

[Intro to brain-like-AGI safety] 14. Controlled AGI

Steven Byrnes11 May 2022 13:17 UTC

48 points

25 comments20 min readLW link

ProjectLawful.com: Eliezer’s latest story, past 1M words

Eliezer Yudkowsky11 May 2022 6:18 UTC

239 points

112 comments1 min readLW link 4 reviews

An Inside View of AI Alignment

Ansh Radhakrishnan11 May 2022 2:16 UTC

32 points

2 comments2 min readLW link

Fighting in various places for a really long time

KatjaGrace11 May 2022 1:50 UTC

36 points

12 comments4 min readLW link

(worldspiritsockpuppet.com)

Stuff I might do if I had covid

KatjaGrace11 May 2022 0:00 UTC

39 points

9 comments1 min readLW link

(worldspiritsockpuppet.com)

Crises Don’t Need Your Software

GabrielExists10 May 2022 21:06 UTC

59 points

18 comments6 min readLW link

Ceiling Fan Air Filter

jefftk10 May 2022 14:20 UTC

18 points

9 comments1 min readLW link

(www.jefftk.com)

The limits of AI safety via debate

Marius Hobbhahn10 May 2022 13:33 UTC

36 points

8 comments10 min readLW link

Examining Armstrong’s category of generalized models

Morgan_Rogers10 May 2022 9:07 UTC

14 points

0 comments7 min readLW link

Dath Ilani Rule of Law

David Udell10 May 2022 6:17 UTC

26 points

25 comments4 min readLW link

AI safety should be made more accessible using non text-based media

Massimog10 May 2022 3:14 UTC

2 points

4 comments4 min readLW link

LessWrong Now Has Dark Mode

jimrandomh10 May 2022 1:21 UTC

144 points

31 comments1 min readLW link

Conditions for mathematical equivalence of Stochastic Gradient Descent and Natural Selection

Oliver Sourbut9 May 2022 21:38 UTC

73 points

19 comments8 min readLW link 1 review

(www.oliversourbut.net)

AI Alignment YouTube Playlists

jacquesthibs and remember

9 May 2022 21:33 UTC

31 points

4 comments1 min readLW link

When is AI safety research harmful?

NathanBarnard9 May 2022 18:19 UTC

2 points

0 comments8 min readLW link

A Bird’s Eye View of the ML Field [Pragmatic AI Safety #2]

Dan H and TW123

9 May 2022 17:18 UTC

165 points

8 comments35 min readLW link

Introduction to Pragmatic AI Safety [Pragmatic AI Safety #1]

Dan H and TW123

9 May 2022 17:06 UTC

80 points

3 comments6 min readLW link

Jobs: Help scale up LM alignment research at NYU

Sam Bowman9 May 2022 14:12 UTC

60 points

1 comment1 min readLW link

Microphone on Electric Mandolin

jefftk9 May 2022 14:00 UTC

16 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] Thought experiment: Imagine you were assigned to help a random person in your community become as peaceful and joyful as the most peaceful and joyful person you’d ever met. What would you try?

nonzerosum9 May 2022 13:53 UTC

5 points

5 comments1 min readLW link

[Question] Willing to be your music mentor in exchange for video editing mentorship

monkymind9 May 2022 11:57 UTC

8 points

0 comments1 min readLW link

Updating Utility Functions

JustinShovelain and Joar Skalse

9 May 2022 9:44 UTC

42 points

6 comments8 min readLW link

[Scribble] Bad Reasons Behind Different Systems and a Story with No Good Moral

Rana Dexsin9 May 2022 5:21 UTC

9 points

0 comments5 min readLW link

[Question] What is the best day to celebrate Smallpox Eradication Day?

Orborde9 May 2022 4:02 UTC

7 points

6 comments1 min readLW link

A reason behind bad systems, and moral implications of seeing this reason

Edward Pascal9 May 2022 3:16 UTC

4 points

12 comments2 min readLW link

An Alternative Interpretation of Physics

dadadarren9 May 2022 0:52 UTC

19 points

10 comments5 min readLW link

(www.sleepingbeautyproblem.com)

Ion Implantation: Theory, Equipment, Process, Alternatives

nomagicpill8 May 2022 22:30 UTC

6 points

0 comments16 min readLW link

(210ethan.github.io)

[Question] Long COVID risk: How to maintain an up to date risk assessment so we can go back to normal life?

Sameerishere8 May 2022 19:56 UTC

19 points

34 comments1 min readLW link

Demonstrating MWI by interfering human simulations

Yair Halberstadt8 May 2022 17:28 UTC

12 points

25 comments2 min readLW link

Notes from a conversation with Ing. Agr. Adriana Balzarini

Pablo Repetto8 May 2022 15:56 UTC

5 points

0 comments2 min readLW link

(pabloernesto.github.io)

Elementary Infra-Bayesianism

Jan8 May 2022 12:23 UTC

41 points

3 comments7 min readLW link

(universalprior.substack.com)

Cambridge LW Meetup: Books That Change

Darmani and Tony Wang

8 May 2022 5:23 UTC

5 points

0 comments1 min readLW link

Video and Transcript of Presentation on Existential Risk from Power-Seeking AI

Joe Carlsmith8 May 2022 3:50 UTC

20 points

1 comment29 min readLW link

[Question] Algorithmic formalization of FDT?

Shmi8 May 2022 1:36 UTC

12 points

8 comments1 min readLW link

Experience on Meloxicam

jefftk8 May 2022 0:30 UTC

9 points

5 comments1 min readLW link

(www.jefftk.com)