All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Outreach success: Intro to AI risk that has been successful

Michael Tontchev1 Jun 2023 23:12 UTC

83 points

8 comments74 min readLW link

(medium.com)

Open Source LLMs Can Now Actively Lie

Josh Levy1 Jun 2023 22:03 UTC

6 points

0 comments3 min readLW link

Safe AI and moral AI

William D'Alessandro1 Jun 2023 21:36 UTC

−3 points

0 comments10 min readLW link

AI #14: A Very Good Sentence

Zvi1 Jun 2023 21:30 UTC

118 points

30 comments65 min readLW link

(thezvi.wordpress.com)

Four levels of understanding decision theory

Max H1 Jun 2023 20:55 UTC

12 points

11 comments4 min readLW link

Things I Learned by Spending Five Thousand Hours In Non-EA Charities

jenn1 Jun 2023 20:48 UTC

438 points

35 comments8 min readLW link 1 review

(jenn.site)

self-improvement-executors are not goal-maximizers

bhauth1 Jun 2023 20:46 UTC

14 points

0 comments1 min readLW link

Experimental Fat Loss

johnlawrenceaspden1 Jun 2023 20:26 UTC

23 points

5 comments1 min readLW link

Yudkowsky vs Hanson on FOOM: Whose Predictions Were Better?

1a3orn1 Jun 2023 19:36 UTC

144 points

76 comments24 min readLW link 2 reviews

Progress links and tweets, 2023-06-01

jasoncrawford1 Jun 2023 19:03 UTC

10 points

3 comments1 min readLW link

(rootsofprogress.org)

[Question] When does an AI become intelligent enough to become self-aware and power-seeking?

FinalFormal21 Jun 2023 18:09 UTC

1 point

1 comment1 min readLW link

Uncertainty about the future does not imply that AGI will go well

Lauro Langosco1 Jun 2023 17:38 UTC

62 points

11 comments7 min readLW link

[Question] What are the arguments for/against FOOM?

FinalFormal21 Jun 2023 17:23 UTC

8 points

0 comments1 min readLW link

Change my mind: Veganism entails trade-offs, and health is one of the axes

Elizabeth1 Jun 2023 17:10 UTC

161 points

85 comments19 min readLW link 2 reviews

(acesounderglass.com)

The unspoken but ridiculous assumption of AI doom: the hidden doom assumption

Christopher King1 Jun 2023 17:01 UTC

−9 points

1 comment3 min readLW link

Don’t waste your time meditating on meditation retreats!

EternallyBlissful1 Jun 2023 16:56 UTC

23 points

7 comments11 min readLW link

[Request]: Use “Epilogenics” instead of “Eugenics” in most circumstances

GeneSmith1 Jun 2023 15:36 UTC

58 points

51 comments1 min readLW link

Book Club: Thomas Schelling’s “The Strategy of Conflict”

Optimization Process1 Jun 2023 15:29 UTC

6 points

1 comment1 min readLW link

Probably tell your friends when they make big mistakes

Chi Nguyen1 Jun 2023 14:30 UTC

15 points

1 comment7 min readLW link

Yes, avoiding extinction from AI is an urgent priority: a response to Seth Lazar, Jeremy Howard, and Arvind Narayanan.

Soroush Pour1 Jun 2023 13:38 UTC

17 points

0 comments5 min readLW link

(www.soroushjp.com)

Work dumber not smarter

lemonhope1 Jun 2023 12:40 UTC

101 points

17 comments3 min readLW link

Short Remark on the (subjective) mathematical ‘naturalness’ of the Nanda—Lieberum addition modulo 113 algorithm

carboniferous_umbraculum 1 Jun 2023 11:31 UTC

104 points

12 comments2 min readLW link

How will they feed us

meijer19731 Jun 2023 8:49 UTC

4 points

3 comments5 min readLW link

“LLMs Don’t Have a Coherent Model of the World”—What it Means, Why it Matters

Davidmanheim1 Jun 2023 7:46 UTC

32 points

2 comments7 min readLW link

General intelligence: what is it, what makes it hard, and will we have it soon?

homeopathicsyzygy1 Jun 2023 6:46 UTC

2 points

0 comments21 min readLW link

Maximal Sentience: A Sentience Spectrum and Test Foundation

Snowyiu1 Jun 2023 6:45 UTC

1 point

2 comments4 min readLW link

Re: The Crux List

Logan Zoellner1 Jun 2023 4:48 UTC

11 points

0 comments2 min readLW link

An explanation of decision theories

metachirality1 Jun 2023 3:42 UTC

20 points

4 comments5 min readLW link

Dancing to Positional Calling

jefftk1 Jun 2023 2:40 UTC

11 points

2 comments2 min readLW link

(www.jefftk.com)

Intrinsic vs. Extrinsic Alignment

Alfonso Pérez Escudero1 Jun 2023 1:06 UTC

1 point

1 comment3 min readLW link

Limiting factors to predict AI take-off speed

Alfonso Pérez Escudero31 May 2023 23:19 UTC

1 point

0 comments6 min readLW link

Unpredictability and the Increasing Difficulty of AI Alignment for Increasingly Intelligent AI

Max_He-Ho31 May 2023 22:25 UTC

5 points

2 comments20 min readLW link

Shutdown-Seeking AI

Simon Goldstein31 May 2023 22:19 UTC

50 points

32 comments15 min readLW link

Full Automation is Unlikely and Unnecessary for Explosive Growth

aog31 May 2023 21:55 UTC

28 points

3 comments5 min readLW link

LessWrong Community Weekend 2023 Updates: Keynote Speaker Malcolm Ocean, Remaining Tickets and More

Henry Prowbell31 May 2023 21:53 UTC

23 points

0 comments2 min readLW link

The Divine Move Paradox & Thinking as a Species

Christopher James Hart31 May 2023 21:38 UTC

9 points

8 comments3 min readLW link

Intent-aligned AI systems deplete human agency: the need for agency foundations research in AI safety

catubc31 May 2023 21:18 UTC

26 points

4 comments11 min readLW link

[Question] How much overlap is there between the utility function of GPT-n and GPT-(n+1), assuming both are near AGI?

Phosphorous31 May 2023 20:28 UTC

2 points

0 comments2 min readLW link

My AI-risk cartoon

pre31 May 2023 19:46 UTC

6 points

0 comments1 min readLW link

Evaluation Evidence Reconstructions of Mock Crimes Submission 3

Alan E Dunne31 May 2023 19:03 UTC

−1 points

0 comments3 min readLW link

Improving Mathematical Reasoning with-Process Supervision

p.b.31 May 2023 19:00 UTC

14 points

3 comments1 min readLW link

(openai.com)

The Crux List

Zvi31 May 2023 18:30 UTC

72 points

19 comments33 min readLW link

(thezvi.wordpress.com)

Stages of Survival

Zvi31 May 2023 18:30 UTC

44 points

0 comments17 min readLW link

(thezvi.wordpress.com)

Types and Degrees of Alignment

Zvi31 May 2023 18:30 UTC

36 points

10 comments8 min readLW link

(thezvi.wordpress.com)

To Predict What Happens, Ask What Happens

Zvi31 May 2023 18:30 UTC

81 points

0 comments9 min readLW link

(thezvi.wordpress.com)

A push towards interactive transformer decoding

R0bk31 May 2023 17:56 UTC

3 points

0 comments2 min readLW link

(github.com)

Neuroevolution, Social Intelligence, and Logic

vinnik.dmitry0731 May 2023 17:54 UTC

1 point

0 comments10 min readLW link

Contrast Pairs Drive the Empirical Performance of Contrast Consistent Search (CCS)

Scott Emmons31 May 2023 17:09 UTC

97 points

1 comment6 min readLW link 1 review

Cosmopolitan values don’t come free

So8res31 May 2023 15:58 UTC

138 points

87 comments1 min readLW link

[Question] Arguments Against Fossil Future?

Sable31 May 2023 13:41 UTC

13 points

29 comments1 min readLW link