All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

Humans, chimpanzees and other animals

gjmMay 30, 2023, 11:53 PM

21 points

15 votes

Overall karma indicates overall quality.

18 comments1 min readLW link

The case for removing alignment and ML research from the training dataset

berenMay 30, 2023, 8:54 PM

50 points

23 votes

Overall karma indicates overall quality.

8 comments5 min readLW link

Why Job Displacement Predictions are Wrong: Explanations of Cognitive Automation

Moritz WallawitschMay 30, 2023, 8:43 PM

−5 points

3 votes

Overall karma indicates overall quality.

0 comments8 min readLW link

PaLM-2 & GPT-4 in “Extrapolating GPT-N performance”

Lukas FinnvedenMay 30, 2023, 6:33 PM

57 points

27 votes

Overall karma indicates overall quality.

6 comments6 min readLW link

Why I don’t think that the probability that AGI kills everyone is roughly 1 (but rather around 0.995).

BastumannenMay 30, 2023, 5:54 PM

−6 points

5 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

AI X-risk is a possible solution to the Fermi Paradox

magic9mushroomMay 30, 2023, 5:42 PM

5 points

15 votes

Overall karma indicates overall quality.

22 comments2 min readLW link 2 reviews

LIMA: Less Is More for Alignment

Ulisse MiniMay 30, 2023, 5:10 PM

16 points

3 votes

Overall karma indicates overall quality.

6 comments1 min readLW link

(arxiv.org)

Boomerang—protocol to dissolve some commitment races

Filip SondejMay 30, 2023, 4:21 PM

37 points

17 votes

Overall karma indicates overall quality.

10 comments8 min readLW link

Announcing Apollo Research

Marius Hobbhahn, beren, Lee Sharkey, Lucius Bushnaq, Dan Braun, Mikita Balesni and Jérémy Scheurer

May 30, 2023, 4:17 PM

217 points

94 votes

Overall karma indicates overall quality.

11 comments8 min readLW link

Advice for new alignment people: Info Max

Jonas HallgrenMay 30, 2023, 3:42 PM

23 points

14 votes

Overall karma indicates overall quality.

4 comments5 min readLW link

[Question] Who is liable for AI?

jmhMay 30, 2023, 1:54 PM

14 points

3 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Dan H and Orpheus16

May 30, 2023, 11:52 AM

20 points

8 votes

Overall karma indicates overall quality.

0 comments6 min readLW link

(newsletter.safe.ai)

The bullseye framework: My case against AI doom

titotalMay 30, 2023, 11:52 AM

89 points

65 votes

Overall karma indicates overall quality.

35 comments17 min readLW link

Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures

Dan HMay 30, 2023, 9:05 AM

382 points

165 votes

Overall karma indicates overall quality.

78 comments1 min readLW link 1 review

(www.safe.ai)

Theoretical Limitations of Autoregressive Models

Gabriel WuMay 30, 2023, 2:37 AM

20 points

11 votes

Overall karma indicates overall quality.

1 comment10 min readLW link

(gabrieldwu.github.io)

A book review for “Animal Weapons” and cross-applying the lessons to x-risk

Habeeb AbdulfatahMay 30, 2023, 12:58 AM

−6 points

4 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

(www.super-linear.org)

Without a trajectory change, the development of AGI is likely to go badly

Max HMay 29, 2023, 11:42 PM

21 points

7 votes

Overall karma indicates overall quality.

2 comments13 min readLW link

Winners-take-how-much?

YonatanKMay 29, 2023, 9:56 PM

3 points

5 votes

Overall karma indicates overall quality.

2 comments3 min readLW link

Reply to a fertility doctor concerning polygenic embryo screening

GeneSmithMay 29, 2023, 9:50 PM

59 points

29 votes

Overall karma indicates overall quality.

6 comments8 min readLW link

Sentience matters

So8resMay 29, 2023, 9:25 PM

144 points

90 votes

Overall karma indicates overall quality.

96 comments2 min readLW link

Wikipedia as an introduction to the alignment problem

SoerenMindMay 29, 2023, 6:43 PM

83 points

46 votes

Overall karma indicates overall quality.

10 comments1 min readLW link

(en.wikipedia.org)

[Question] What are some of the best introductions/breakdowns of AI existential risk for those unfamiliar?

Isaac KingMay 29, 2023, 5:04 PM

17 points

5 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

Creating Flashcards with LLMs

Diogo CruzMay 29, 2023, 4:55 PM

15 points

12 votes

Overall karma indicates overall quality.

3 comments9 min readLW link

On the Impossibility of Intelligent Paperclip Maximizers

Michael SimkinMay 29, 2023, 4:55 PM

−21 points

13 votes

Overall karma indicates overall quality.

5 comments4 min readLW link

Minimum Viable Exterminator

Richard HorvathMay 29, 2023, 4:32 PM

14 points

12 votes

Overall karma indicates overall quality.

5 comments5 min readLW link

An LLM-based “exemplary actor”

Roman LeventovMay 29, 2023, 11:12 AM

16 points

5 votes

Overall karma indicates overall quality.

0 comments12 min readLW link

Aligning an H-JEPA agent via training on the outputs of an LLM-based “exemplary actor”

Roman LeventovMay 29, 2023, 11:08 AM

12 points

8 votes

Overall karma indicates overall quality.

10 comments30 min readLW link

Gemini will bring the next big timeline update

p.b.May 29, 2023, 6:05 AM

50 points

36 votes

Overall karma indicates overall quality.

6 comments1 min readLW link

Proposed Alignment Technique: OSNR (Output Sanitization via Noising and Reconstruction) for Safer Usage of Potentially Misaligned AGI

sudoMay 29, 2023, 1:35 AM

14 points

4 votes

Overall karma indicates overall quality.

9 comments6 min readLW link

Morality is Accidental & Self-Congratulatory

ymeskhoutMay 29, 2023, 12:40 AM

26 points

32 votes

Overall karma indicates overall quality.

40 comments5 min readLW link

TinyStories: Small Language Models That Still Speak Coherent English

Ulisse MiniMay 28, 2023, 10:23 PM

67 points

34 votes

Overall karma indicates overall quality.

8 comments2 min readLW link

(arxiv.org)

“Membranes” is better terminology than “boundaries” alone

Chris Lakin and the gears to ascension

May 28, 2023, 10:16 PM

30 points

14 votes

Overall karma indicates overall quality.

12 comments3 min readLW link

The king token

p.b.May 28, 2023, 7:18 PM

17 points

8 votes

Overall karma indicates overall quality.

0 comments4 min readLW link

Language Agents Reduce the Risk of Existential Catastrophe

cdkg and Simon Goldstein

May 28, 2023, 7:10 PM

39 points

35 votes

Overall karma indicates overall quality.

14 comments26 min readLW link

Devil’s Advocate: Adverse Selection Against Conscientiousness

lionhearted (Sebastian Marshall)May 28, 2023, 5:53 PM

10 points

6 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

Reacts now enabled on 100% of posts, though still just experimenting

RubyMay 28, 2023, 5:36 AM

88 points

37 votes

Overall karma indicates overall quality.

73 comments2 min readLW link

Kelly betting vs expectation maximization

MorgneticFieldMay 28, 2023, 1:54 AM

35 points

23 votes

Overall karma indicates overall quality.

33 comments5 min readLW link

Twin Cities ACX Meetup—June 2023

Timothy M.May 27, 2023, 8:11 PM

1 point

1 vote

Overall karma indicates overall quality.

1 comment1 min readLW link

Project Idea: Challenge Groups for Alignment Researchers

Adam ZernerMay 27, 2023, 8:10 PM

13 points

9 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

Introspective Bayes

False NameMay 27, 2023, 7:35 PM

−3 points

7 votes

Overall karma indicates overall quality.

2 comments16 min readLW link

Should Rational Animations invite viewers to read content on LessWrong?

WriterMay 27, 2023, 7:26 PM

40 points

15 votes

Overall karma indicates overall quality.

9 comments3 min readLW link

Who are the Experts on Cryonics?

Mati_RoyMay 27, 2023, 7:24 PM

30 points

15 votes

Overall karma indicates overall quality.

9 comments1 min readLW link

(biostasis.substack.com)

AI and Planet Earth are incompatible.

archeonMay 27, 2023, 6:59 PM

−4 points

7 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

South Bay ACX/LW Meetup

ISMay 27, 2023, 5:25 PM

2 points

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link

Hands-On Experience Is Not Magic

Thane RuthenisMay 27, 2023, 4:57 PM

22 points

32 votes

Overall karma indicates overall quality.

14 comments5 min readLW link

Is Deontological AI Safe? [Feedback Draft]

Dan H and William D'Alessandro

May 27, 2023, 4:39 PM

19 points

16 votes

Overall karma indicates overall quality.

15 comments20 min readLW link

San Francisco ACX Meetup “First Saturday” June 3, 1 pm

guenaelMay 27, 2023, 1:58 PM

1 point

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link

Papers on protein design

alexlyzhovMay 27, 2023, 1:18 AM

9 points

5 votes

Overall karma indicates overall quality.

0 comments3 min readLW link

D&D.Sci 5E: Return of the League of Defenders

aphyerMay 26, 2023, 8:39 PM

42 points

14 votes

Overall karma indicates overall quality.

11 comments3 min readLW link

Seeking (Paid) Case Studies on Standards

HoldenKarnofskyMay 26, 2023, 5:58 PM

69 points

21 votes

Overall karma indicates overall quality.

9 comments11 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer