All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] What constraints does deep learning place on alignment plans?

Garrett BakerMay 3, 2023, 8:40 PM

9 points

0 comments1 min readLW link

AGI rising: why we are in a new era of acute risk and increasing public awareness, and what to do now

Greg CMay 3, 2023, 8:26 PM

25 points

12 comments LW link

Formalizing the “AI x-risk is unlikely because it is ridiculous” argument

Christopher KingMay 3, 2023, 6:56 PM

48 points

17 comments3 min readLW link

[Question] List of notable people who believe in AI X-risk?

vlad.proexMay 3, 2023, 6:46 PM

14 points

4 comments1 min readLW link

[Question] LessWrong exporting?

axiomAdministratorMay 3, 2023, 6:34 PM

0 points

3 comments1 min readLW link

Progress links and tweets, 2023-05-03

jasoncrawfordMay 3, 2023, 4:23 PM

13 points

0 comments2 min readLW link

(rootsofprogress.org)

Personhood is a Religious Belief

jan SijanMay 3, 2023, 4:16 PM

−41 points

28 comments6 min readLW link

Slowing AI: Crunch time

Zach Stein-PerlmanMay 3, 2023, 3:00 PM

11 points

1 comment2 min readLW link

Finding Neurons in a Haystack: Case Studies with Sparse Probing

wesg and Neel Nanda

May 3, 2023, 1:30 PM

33 points

6 comments2 min readLW link 1 review

(arxiv.org)

Monthly Roundup #6: May 2023

ZviMay 3, 2023, 12:50 PM

31 points

12 comments24 min readLW link

(thezvi.wordpress.com)

[Question] How much do personal biases in risk assessment affect assessment of AI risks?

Gordon Seidoh WorleyMay 3, 2023, 6:12 AM

10 points

8 comments1 min readLW link

Communication strategies for autism, with examples

stoneflyMay 3, 2023, 5:25 AM

16 points

2 comments7 min readLW link

Understand how other people think: a theory of worldviews.

spencergMay 3, 2023, 3:57 AM

2 points

8 comments LW link

“Copilot” type AI integration could lead to training data needed for AGI

anithiteMay 3, 2023, 12:57 AM

8 points

0 comments2 min readLW link

Averting Catastrophe: Decision Theory for COVID-19, Climate Change, and Potential Disasters of All Kinds

JakubKMay 2, 2023, 10:50 PM

10 points

0 comments LW link

A Case for the Least Forgiving Take On Alignment

Thane RuthenisMay 2, 2023, 9:34 PM

100 points

85 comments22 min readLW link

Are Emergent Abilities of Large Language Models a Mirage? [linkpost]

Matthew BarnettMay 2, 2023, 9:01 PM

53 points

19 comments1 min readLW link

(arxiv.org)

Does descaling a kettle help? Theory and practice

philhMay 2, 2023, 8:20 PM

35 points

25 comments8 min readLW link

(reasonableapproximation.net)

Avoiding xrisk from AI doesn’t mean focusing on AI xrisk

Stuart_ArmstrongMay 2, 2023, 7:27 PM

67 points

7 comments3 min readLW link

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

ozhang, Dan H and Orpheus16

May 2, 2023, 6:41 PM

32 points

0 comments5 min readLW link

(newsletter.safe.ai)

My best system yet: text-based project management

jtMay 2, 2023, 5:44 PM

6 points

8 comments5 min readLW link

[Question] What’s the state of AI safety in Japan?

ChristianKlMay 2, 2023, 5:06 PM

5 points

1 comment1 min readLW link

Five Worlds of AI (by Scott Aaronson and Boaz Barak)

mishkaMay 2, 2023, 1:23 PM

22 points

6 comments1 min readLW link 1 review

(scottaaronson.blog)

Systems that cannot be unsafe cannot be safe

DavidmanheimMay 2, 2023, 8:53 AM

62 points

27 comments2 min readLW link

AGI safety career advice

Richard_NgoMay 2, 2023, 7:36 AM

132 points

24 comments13 min readLW link

An Impossibility Proof Relevant to the Shutdown Problem and Corrigibility

AudereMay 2, 2023, 6:52 AM

66 points

13 comments9 min readLW link

Some Thoughts on Virtue Ethics for AIs

peligrietzerMay 2, 2023, 5:46 AM

83 points

8 comments4 min readLW link

Technological unemployment as another test for rationalist winning

RomanHaukssonMay 2, 2023, 4:16 AM

14 points

5 comments1 min readLW link

The Moral Copernican Principle

LegionnaireMay 2, 2023, 3:25 AM

5 points

7 comments2 min readLW link

Open & Welcome Thread—May 2023

RubyMay 2, 2023, 2:58 AM

22 points

41 comments1 min readLW link

Summaries of top forum posts (24th − 30th April 2023)

Zoe WilliamsMay 2, 2023, 2:30 AM

12 points

1 comment LW link

AXRP Episode 21 - Interpretability for Engineers with Stephen Casper

DanielFilanMay 2, 2023, 12:50 AM

12 points

1 comment66 min readLW link

Getting Your Eyes On

LoganStrohlMay 2, 2023, 12:33 AM

65 points

11 comments14 min readLW link

What 2025 looks like

RubyMay 1, 2023, 10:53 PM

75 points

17 comments15 min readLW link

[Question] Natural Selection vs Gradient Descent

CuriousApe11May 1, 2023, 10:16 PM

4 points

3 comments1 min readLW link

A[I] Zombie Apocalypse Is Already Upon Us

NickHarrisMay 1, 2023, 10:02 PM

−6 points

4 comments2 min readLW link

Geoff Hinton Quits Google

Adam ShaiMay 1, 2023, 9:03 PM

98 points

14 comments1 min readLW link

The Apprentice Thread 2

hathMay 1, 2023, 8:09 PM

50 points

19 comments1 min readLW link

Budapest, Hungary – ACX Meetups Everywhere Spring 2023

Richard Horvath, Timothy Underwood and marta_k

May 1, 2023, 5:36 PM

4 points

0 comments1 min readLW link

In favor of steelmanning

jpMay 1, 2023, 5:12 PM

36 points

6 comments LW link

Shah (DeepMind) and Leahy (Conjecture) Discuss Alignment Cruxes

OliviaJ, Rohin Shah, Connor Leahy and Andrea_Miotti

May 1, 2023, 4:47 PM

96 points

10 comments30 min readLW link

Distinguishing misuse is difficult and uncomfortable

lemonhopeMay 1, 2023, 4:23 PM

17 points

3 comments1 min readLW link

[Question] Does agency necessarily imply self-preservation instinct?

Mislav JurićMay 1, 2023, 4:06 PM

5 points

8 comments1 min readLW link

What Boston Can Teach Us About What a Woman Is

ymeskhout1 May 2023 15:34 UTC

18 points

45 comments12 min readLW link

The Rocket Alignment Problem, Part 2

Zvi1 May 2023 14:30 UTC

40 points

20 comments9 min readLW link

(thezvi.wordpress.com)

Socialist Democratic-Republic GAME: 12 Amendments to the Constitutions of the Free World

monkymind1 May 2023 13:13 UTC

−34 points

0 comments1 min readLW link

[Question] Where is all this evidence of UFOs?

Logan Zoellner1 May 2023 12:13 UTC

29 points

42 comments1 min readLW link

LessWrong Community Weekend 2023 [Applications now closed]

Henry Prowbell1 May 2023 9:31 UTC

43 points

0 comments6 min readLW link

LessWrong Community Weekend 2023 [Applications now closed]

Henry Prowbell1 May 2023 9:08 UTC

89 points

0 comments6 min readLW link

[Question] In AI Risk what is the base model of the AI?

jmh1 May 2023 3:25 UTC

3 points

1 comment1 min readLW link