All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] How much do personal biases in risk assessment affect assessment of AI risks?

Gordon Seidoh WorleyMay 3, 2023, 6:12 AM

10 points

8 comments1 min readLW link

Communication strategies for autism, with examples

stoneflyMay 3, 2023, 5:25 AM

16 points

2 comments7 min readLW link

Understand how other people think: a theory of worldviews.

spencergMay 3, 2023, 3:57 AM

2 points

8 comments LW link

“Copilot” type AI integration could lead to training data needed for AGI

anithiteMay 3, 2023, 12:57 AM

8 points

0 comments2 min readLW link

Averting Catastrophe: Decision Theory for COVID-19, Climate Change, and Potential Disasters of All Kinds

JakubKMay 2, 2023, 10:50 PM

10 points

0 comments LW link

A Case for the Least Forgiving Take On Alignment

Thane RuthenisMay 2, 2023, 9:34 PM

100 points

85 comments22 min readLW link

Are Emergent Abilities of Large Language Models a Mirage? [linkpost]

Matthew BarnettMay 2, 2023, 9:01 PM

53 points

19 comments1 min readLW link

(arxiv.org)

Does descaling a kettle help? Theory and practice

philhMay 2, 2023, 8:20 PM

35 points

25 comments8 min readLW link

(reasonableapproximation.net)

Avoiding xrisk from AI doesn’t mean focusing on AI xrisk

Stuart_ArmstrongMay 2, 2023, 7:27 PM

67 points

7 comments3 min readLW link

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

ozhang, Dan H and Orpheus16

May 2, 2023, 6:41 PM

32 points

0 comments5 min readLW link

(newsletter.safe.ai)

My best system yet: text-based project management

jtMay 2, 2023, 5:44 PM

6 points

8 comments5 min readLW link

[Question] What’s the state of AI safety in Japan?

ChristianKlMay 2, 2023, 5:06 PM

5 points

1 comment1 min readLW link

Five Worlds of AI (by Scott Aaronson and Boaz Barak)

mishkaMay 2, 2023, 1:23 PM

22 points

6 comments1 min readLW link 1 review

(scottaaronson.blog)

Systems that cannot be unsafe cannot be safe

DavidmanheimMay 2, 2023, 8:53 AM

62 points

27 comments2 min readLW link

AGI safety career advice

Richard_NgoMay 2, 2023, 7:36 AM

132 points

24 comments13 min readLW link

An Impossibility Proof Relevant to the Shutdown Problem and Corrigibility

AudereMay 2, 2023, 6:52 AM

66 points

13 comments9 min readLW link

Some Thoughts on Virtue Ethics for AIs

peligrietzerMay 2, 2023, 5:46 AM

83 points

8 comments4 min readLW link

Technological unemployment as another test for rationalist winning

RomanHaukssonMay 2, 2023, 4:16 AM

14 points

5 comments1 min readLW link

The Moral Copernican Principle

LegionnaireMay 2, 2023, 3:25 AM

5 points

7 comments2 min readLW link

Open & Welcome Thread—May 2023

RubyMay 2, 2023, 2:58 AM

22 points

41 comments1 min readLW link

Summaries of top forum posts (24th − 30th April 2023)

Zoe WilliamsMay 2, 2023, 2:30 AM

12 points

1 comment LW link

AXRP Episode 21 - Interpretability for Engineers with Stephen Casper

DanielFilanMay 2, 2023, 12:50 AM

12 points

1 comment66 min readLW link

Getting Your Eyes On

LoganStrohlMay 2, 2023, 12:33 AM

65 points

11 comments14 min readLW link

What 2025 looks like

RubyMay 1, 2023, 10:53 PM

75 points

17 comments15 min readLW link

[Question] Natural Selection vs Gradient Descent

CuriousApe11May 1, 2023, 10:16 PM

4 points

3 comments1 min readLW link

A[I] Zombie Apocalypse Is Already Upon Us

NickHarrisMay 1, 2023, 10:02 PM

−6 points

4 comments2 min readLW link

Geoff Hinton Quits Google

Adam ShaiMay 1, 2023, 9:03 PM

98 points

14 comments1 min readLW link

The Apprentice Thread 2

hathMay 1, 2023, 8:09 PM

50 points

19 comments1 min readLW link

Budapest, Hungary – ACX Meetups Everywhere Spring 2023

Richard Horvath, Timothy Underwood and marta_k

May 1, 2023, 5:36 PM

4 points

0 comments1 min readLW link

In favor of steelmanning

jpMay 1, 2023, 5:12 PM

36 points

6 comments LW link

Shah (DeepMind) and Leahy (Conjecture) Discuss Alignment Cruxes

OliviaJ, Rohin Shah, Connor Leahy and Andrea_Miotti

May 1, 2023, 4:47 PM

96 points

10 comments30 min readLW link

Distinguishing misuse is difficult and uncomfortable

lemonhopeMay 1, 2023, 4:23 PM

17 points

3 comments1 min readLW link

[Question] Does agency necessarily imply self-preservation instinct?

Mislav JurićMay 1, 2023, 4:06 PM

5 points

8 comments1 min readLW link

What Boston Can Teach Us About What a Woman Is

ymeskhoutMay 1, 2023, 3:34 PM

18 points

45 comments12 min readLW link

The Rocket Alignment Problem, Part 2

ZviMay 1, 2023, 2:30 PM

40 points

20 comments9 min readLW link

(thezvi.wordpress.com)

Socialist Democratic-Republic GAME: 12 Amendments to the Constitutions of the Free World

monkymindMay 1, 2023, 1:13 PM

−34 points

0 comments1 min readLW link

[Question] Where is all this evidence of UFOs?

Logan ZoellnerMay 1, 2023, 12:13 PM

29 points

42 comments1 min readLW link

LessWrong Community Weekend 2023 [Applications now closed]

Henry ProwbellMay 1, 2023, 9:31 AM

43 points

0 comments6 min readLW link

LessWrong Community Weekend 2023 [Applications now closed]

Henry ProwbellMay 1, 2023, 9:08 AM

89 points

0 comments6 min readLW link

[Question] In AI Risk what is the base model of the AI?

jmhMay 1, 2023, 3:25 AM

3 points

1 comment1 min readLW link

Hell is Game Theory Folk Theorems

jessicataMay 1, 2023, 3:16 AM

81 points

102 comments5 min readLW link 1 review

(unstableontology.com)

Safety standards: a framework for AI regulation

joshcMay 1, 2023, 12:56 AM

19 points

0 comments8 min readLW link

neuron spike computational capacity

bhauthMay 1, 2023, 12:28 AM

17 points

0 comments2 min readLW link

Cult of Error

bayesyatina30 Apr 2023 23:33 UTC

5 points

2 comments3 min readLW link

How can one rationally have very high or very low probabilities of extinction in a pre-paradigmatic field?

Shmi30 Apr 2023 21:53 UTC

42 points

15 comments1 min readLW link

A small update to the Sparse Coding interim research report

Lee Sharkey, Dan Braun and beren

30 Apr 2023 19:54 UTC

61 points

5 comments1 min readLW link

Discussion about AI Safety funding (FB transcript)

Orpheus1630 Apr 2023 19:05 UTC

75 points

8 comments LW link

Support me in a Week-Long Picketing Campaign Near OpenAI’s HQ: Seeking Support and Ideas from the LessWrong Community

Percy30 Apr 2023 17:48 UTC

−21 points

15 comments1 min readLW link

money ≠ value

stonefly30 Apr 2023 17:47 UTC

2 points

3 comments3 min readLW link

Vaccine Policies Need Updating

jefftk30 Apr 2023 17:20 UTC

11 points

0 comments1 min readLW link

(www.jefftk.com)