All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Limiting factors to predict AI take-off speed

Alfonso Pérez Escudero31 May 2023 23:19 UTC

1 point

0 comments6 min readLW link

Unpredictability and the Increasing Difficulty of AI Alignment for Increasingly Intelligent AI

Max_He-Ho31 May 2023 22:25 UTC

5 points

2 comments20 min readLW link

Shutdown-Seeking AI

Simon Goldstein31 May 2023 22:19 UTC

50 points

32 comments15 min readLW link

Full Automation is Unlikely and Unnecessary for Explosive Growth

aog31 May 2023 21:55 UTC

28 points

3 comments5 min readLW link

LessWrong Community Weekend 2023 Updates: Keynote Speaker Malcolm Ocean, Remaining Tickets and More

Henry Prowbell31 May 2023 21:53 UTC

23 points

0 comments2 min readLW link

The Divine Move Paradox & Thinking as a Species

Christopher James Hart31 May 2023 21:38 UTC

9 points

8 comments3 min readLW link

Intent-aligned AI systems deplete human agency: the need for agency foundations research in AI safety

catubc31 May 2023 21:18 UTC

26 points

4 comments11 min readLW link

[Question] How much overlap is there between the utility function of GPT-n and GPT-(n+1), assuming both are near AGI?

Phosphorous31 May 2023 20:28 UTC

2 points

0 comments2 min readLW link

My AI-risk cartoon

pre31 May 2023 19:46 UTC

6 points

0 comments1 min readLW link

Evaluation Evidence Reconstructions of Mock Crimes Submission 3

Alan E Dunne31 May 2023 19:03 UTC

−1 points

0 comments3 min readLW link

Improving Mathematical Reasoning with-Process Supervision

p.b.31 May 2023 19:00 UTC

14 points

3 comments1 min readLW link

(openai.com)

The Crux List

Zvi31 May 2023 18:30 UTC

72 points

19 comments33 min readLW link

(thezvi.wordpress.com)

Stages of Survival

Zvi31 May 2023 18:30 UTC

44 points

0 comments17 min readLW link

(thezvi.wordpress.com)

Types and Degrees of Alignment

Zvi31 May 2023 18:30 UTC

36 points

10 comments8 min readLW link

(thezvi.wordpress.com)

To Predict What Happens, Ask What Happens

Zvi31 May 2023 18:30 UTC

81 points

0 comments9 min readLW link

(thezvi.wordpress.com)

A push towards interactive transformer decoding

R0bk31 May 2023 17:56 UTC

3 points

0 comments2 min readLW link

(github.com)

Neuroevolution, Social Intelligence, and Logic

vinnik.dmitry0731 May 2023 17:54 UTC

1 point

0 comments10 min readLW link

Contrast Pairs Drive the Empirical Performance of Contrast Consistent Search (CCS)

Scott Emmons31 May 2023 17:09 UTC

97 points

1 comment6 min readLW link 1 review

Cosmopolitan values don’t come free

So8res31 May 2023 15:58 UTC

138 points

87 comments1 min readLW link

[Question] Arguments Against Fossil Future?

Sable31 May 2023 13:41 UTC

13 points

29 comments1 min readLW link

On Objective Ethics, and a bit about boats

EndlessBlue31 May 2023 11:40 UTC

−7 points

3 comments2 min readLW link

Against Conflating Expertise: Distinguishing AI Development from AI Implication Analysis

Ratios31 May 2023 9:50 UTC

13 points

4 comments1 min readLW link

A rough model for P(AI doom)

Michael Tontchev31 May 2023 8:58 UTC

0 points

1 comment2 min readLW link

[Question] What’s the consensus on porn?

FinalFormal231 May 2023 3:15 UTC

5 points

19 comments1 min readLW link

Product Endorsement: Food for sleep interruptions

Elizabeth31 May 2023 1:50 UTC

45 points

7 comments1 min readLW link

(acesounderglass.com)

Optimal Clothing

Gordon Seidoh Worley31 May 2023 1:00 UTC

31 points

8 comments6 min readLW link

Humans, chimpanzees and other animals

gjm30 May 2023 23:53 UTC

21 points

18 comments1 min readLW link

The case for removing alignment and ML research from the training dataset

beren30 May 2023 20:54 UTC

50 points

8 comments5 min readLW link

Why Job Displacement Predictions are Wrong: Explanations of Cognitive Automation

Moritz Wallawitsch30 May 2023 20:43 UTC

−5 points

0 comments8 min readLW link

PaLM-2 & GPT-4 in “Extrapolating GPT-N performance”

Lukas Finnveden30 May 2023 18:33 UTC

57 points

6 comments6 min readLW link

Why I don’t think that the probability that AGI kills everyone is roughly 1 (but rather around 0.995).

Bastumannen30 May 2023 17:54 UTC

−6 points

0 comments2 min readLW link

AI X-risk is a possible solution to the Fermi Paradox

magic9mushroom30 May 2023 17:42 UTC

5 points

22 comments2 min readLW link 2 reviews

LIMA: Less Is More for Alignment

Ulisse Mini30 May 2023 17:10 UTC

16 points

6 comments1 min readLW link

(arxiv.org)

Boomerang—protocol to dissolve some commitment races

Filip Sondej30 May 2023 16:21 UTC

37 points

10 comments8 min readLW link

Announcing Apollo Research

Marius Hobbhahn, beren, Lee Sharkey, Lucius Bushnaq, Dan Braun, Mikita Balesni and Jérémy Scheurer

30 May 2023 16:17 UTC

217 points

11 comments8 min readLW link

Advice for new alignment people: Info Max

Jonas Hallgren30 May 2023 15:42 UTC

23 points

4 comments5 min readLW link

[Question] Who is liable for AI?

jmh30 May 2023 13:54 UTC

14 points

4 comments1 min readLW link

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Dan H and Orpheus16

30 May 2023 11:52 UTC

20 points

0 comments6 min readLW link

(newsletter.safe.ai)

The bullseye framework: My case against AI doom

titotal30 May 2023 11:52 UTC

89 points

35 comments17 min readLW link

Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures

Dan H30 May 2023 9:05 UTC

382 points

78 comments1 min readLW link 1 review

(www.safe.ai)

Theoretical Limitations of Autoregressive Models

Gabriel Wu30 May 2023 2:37 UTC

20 points

1 comment10 min readLW link

(gabrieldwu.github.io)

A book review for “Animal Weapons” and cross-applying the lessons to x-risk

Habeeb Abdulfatah30 May 2023 0:58 UTC

−6 points

1 comment1 min readLW link

(www.super-linear.org)

Without a trajectory change, the development of AGI is likely to go badly

Max H29 May 2023 23:42 UTC

16 points

2 comments13 min readLW link

Winners-take-how-much?

YonatanK29 May 2023 21:56 UTC

3 points

2 comments3 min readLW link

Reply to a fertility doctor concerning polygenic embryo screening

GeneSmith29 May 2023 21:50 UTC

59 points

6 comments8 min readLW link

Sentience matters

So8res29 May 2023 21:25 UTC

144 points

96 comments2 min readLW link

Wikipedia as an introduction to the alignment problem

SoerenMind29 May 2023 18:43 UTC

83 points

10 comments1 min readLW link

(en.wikipedia.org)

[Question] What are some of the best introductions/breakdowns of AI existential risk for those unfamiliar?

Isaac King29 May 2023 17:04 UTC

17 points

2 comments1 min readLW link

Creating Flashcards with LLMs

Diogo Cruz29 May 2023 16:55 UTC

15 points

3 comments9 min readLW link

On the Impossibility of Intelligent Paperclip Maximizers

Michael Simkin29 May 2023 16:55 UTC

−21 points

5 comments4 min readLW link