All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] If alignment problem was unsolvable, would that avoid doom?

KinranyMay 7, 2023, 10:13 PM

3 points

6 votes

Overall karma indicates overall quality.

3 comments1 min readLW link

An artificially structured argument for expecting AGI ruin

Rob BensingerMay 7, 2023, 9:52 PM

91 points

44 votes

Overall karma indicates overall quality.

26 comments19 min readLW link

Where “the Sequences” Are Wrong

Thoth HermesMay 7, 2023, 8:21 PM

−15 points

14 votes

Overall karma indicates overall quality.

5 comments14 min readLW link

(thothhermes.substack.com)

What’s wrong with being dumb?

Adam ZernerMay 7, 2023, 6:31 PM

14 points

5 votes

Overall karma indicates overall quality.

17 comments2 min readLW link

Categories of Arguing Style : Why being good among rationalists isn’t enough to argue with everyone

Camille Berger May 7, 2023, 5:45 PM

16 points

10 votes

Overall karma indicates overall quality.

0 comments23 min readLW link

Self-Administered Gell-Mann Amnesia

krsMay 7, 2023, 5:44 PM

1 point

1 vote

Overall karma indicates overall quality.

1 comment1 min readLW link

Understanding mesa-optimization using toy models

tilmanr, rusheb, Guillaume Corlouer, Dan Valentine, afspies, mivanitskiy and Can

May 7, 2023, 5:00 PM

46 points

27 votes

Overall karma indicates overall quality.

6 comments10 min readLW link

How to have Polygenically Screened Children

GeneSmithMay 7, 2023, 4:01 PM

368 points

160 votes

Overall karma indicates overall quality.

128 comments27 min readLW link 1 review

Statistical models & the irrelevance of rare exceptions

patrissimoMay 7, 2023, 3:59 PM

36 points

9 votes

Overall karma indicates overall quality.

6 comments2 min readLW link

Let’s look for coherence theorems

ValdesMay 7, 2023, 2:45 PM

25 points

12 votes

Overall karma indicates overall quality.

18 comments6 min readLW link

Graphical Representations of Paul Christiano’s Doom Model

Nathan YoungMay 7, 2023, 1:03 PM

9 points

5 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

An anthropomorphic AI dilemma

TsviBTMay 7, 2023, 12:44 PM

26 points

13 votes

Overall karma indicates overall quality.

0 comments7 min readLW link

Violin Supports

jefftkMay 7, 2023, 12:10 PM

12 points

5 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

(www.jefftk.com)

Properties of Good Textbooks

niplavMay 7, 2023, 8:38 AM

50 points

19 votes

Overall karma indicates overall quality.

11 comments1 min readLW link

Against sacrificing AI transparency for generality gains

Ape in the coatMay 7, 2023, 6:52 AM

4 points

7 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

TED talk by Eliezer Yudkowsky: Unleashing the Power of Artificial Intelligence

bayesedMay 7, 2023, 5:45 AM

49 points

31 votes

Overall karma indicates overall quality.

36 comments1 min readLW link

(www.youtube.com)

Thinking of Convenience as an Economic Term

ozziegooenMay 7, 2023, 1:21 AM

6 points

3 votes

Overall karma indicates overall quality.

0 comments12 min readLW link

(forum.effectivealtruism.org)

Corrigibility, Much more detail than anyone wants to Read

Logan ZoellnerMay 7, 2023, 1:02 AM

27 points

10 votes

Overall karma indicates overall quality.

3 comments7 min readLW link

Residual stream norms grow exponentially over the forward pass

StefanHex and TurnTrout

May 7, 2023, 12:46 AM

77 points

35 votes

Overall karma indicates overall quality.

24 comments9 min readLW link

On the Loebner Silver Prize (a Turing test)

hold_my_fishMay 7, 2023, 12:39 AM

18 points

9 votes

Overall karma indicates overall quality.

2 comments2 min readLW link

Time and Energy Costs to Erase a Bit

DaemonicSigilMay 6, 2023, 11:29 PM

24 points

11 votes

Overall karma indicates overall quality.

32 comments7 min readLW link

How much do you believe your results?

Eric NeymanMay 6, 2023, 8:31 PM

514 points

235 votes

Overall karma indicates overall quality.

18 comments15 min readLW link 4 reviews

(ericneyman.wordpress.com)

Long Covid Risks: 2023 Update

ElizabethMay 6, 2023, 6:20 PM

69 points

31 votes

Overall karma indicates overall quality.

11 comments4 min readLW link

(acesounderglass.com)

Is “red” for GPT-4 the same as “red” for you?

Yusuke HayashiMay 6, 2023, 5:55 PM

9 points

8 votes

Overall karma indicates overall quality.

6 comments2 min readLW link

The Broader Fossil Fuel Community

Jeffrey HeningerMay 6, 2023, 2:49 PM

16 points

12 votes

Overall karma indicates overall quality.

1 comment3 min readLW link

Estimating Norovirus Prevalence

jefftkMay 6, 2023, 11:40 AM

16 points

4 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

(www.jefftk.com)

Alignment as Function Fitting

A.H.May 6, 2023, 11:38 AM

7 points

3 votes

Overall karma indicates overall quality.

0 comments12 min readLW link

My preferred framings for reward misspecification and goal misgeneralisation

Yi-YangMay 6, 2023, 4:48 AM

27 points

10 votes

Overall karma indicates overall quality.

1 comment8 min readLW link

You don’t need to be a genius to be in AI safety research

Claire ShortMay 6, 2023, 2:32 AM

15 points

28 votes

Overall karma indicates overall quality.

1 comment6 min readLW link

Naturalist Collection

LoganStrohlMay 6, 2023, 12:37 AM

71 points

21 votes

Overall karma indicates overall quality.

7 comments15 min readLW link

Do you work at an AI lab? Please quit

Nik SamoylovMay 5, 2023, 11:41 PM

−29 points

13 votes

Overall karma indicates overall quality.

9 comments1 min readLW link

Explaining “Hell is Game Theory Folk Theorems”

electroswingMay 5, 2023, 11:33 PM

57 points

31 votes

Overall karma indicates overall quality.

21 comments5 min readLW link

Sleeping Beauty – the Death Hypothesis

Guillaume CharrierMay 5, 2023, 11:32 PM

7 points

10 votes

Overall karma indicates overall quality.

8 comments5 min readLW link

Orthogonal’s Formal-Goal Alignment theory of change

Tamsin LeakeMay 5, 2023, 10:36 PM

69 points

33 votes

Overall karma indicates overall quality.

13 comments4 min readLW link

(carado.moe)

A smart enough LLM might be deadly simply if you run it for long enough

Mikhail SaminMay 5, 2023, 8:49 PM

19 points

17 votes

Overall karma indicates overall quality.

16 comments8 min readLW link

What Jason has been reading, May 2023: “Protopia,” complex systems, Daedalus vs. Icarus, and more

jasoncrawfordMay 5, 2023, 7:54 PM

26 points

9 votes

Overall karma indicates overall quality.

2 comments11 min readLW link

(rootsofprogress.org)

CHAT Diplomacy: LLMs and National Security

SebastianG May 5, 2023, 7:45 PM

25 points

10 votes

Overall karma indicates overall quality.

6 comments7 min readLW link

Linkpost for Accursed Farms Discussion / debate with AI expert Eliezer Yudkowsky

gilchMay 5, 2023, 6:20 PM

14 points

9 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

(www.youtube.com)

Regulate or Compete? The China Factor in U.S. AI Policy (NAIR #2)

charles_mMay 5, 2023, 5:43 PM

2 points

2 votes

Overall karma indicates overall quality.

1 comment7 min readLW link

(navigatingairisks.substack.com)

Kingfisher Live CD Process

jefftkMay 5, 2023, 5:00 PM

13 points

3 votes

Overall karma indicates overall quality.

0 comments3 min readLW link

(www.jefftk.com)

What can we learn from Bayes about reasoning?

jasoncrawfordMay 5, 2023, 3:52 PM

22 points

7 votes

Overall karma indicates overall quality.

11 comments1 min readLW link

[Question] Why not use active SETI to prevent AI Doom?

RomanSMay 5, 2023, 2:41 PM

13 points

20 votes

Overall karma indicates overall quality.

13 comments1 min readLW link

Investigating Emergent Goal-Like Behavior in Large Language Models using Experimental Economics

phelps-sgMay 5, 2023, 11:15 AM

6 points

4 votes

Overall karma indicates overall quality.

1 comment4 min readLW link

Monthly Shorts 4/23

CelerMay 5, 2023, 7:20 AM

8 points

4 votes

Overall karma indicates overall quality.

1 comment3 min readLW link

(keller.substack.com)

[Question] What is it like to be a compatibilist?

tslarmMay 5, 2023, 2:56 AM

8 points

5 votes

Overall karma indicates overall quality.

72 comments1 min readLW link

Transcript of a presentation on catastrophic risks from AI

RobertMMay 5, 2023, 1:38 AM

6 points

1 vote

Overall karma indicates overall quality.

0 comments8 min readLW link

How to get good at programming

Ulisse MiniMay 5, 2023, 1:14 AM

40 points

25 votes

Overall karma indicates overall quality.

3 comments2 min readLW link

A brief collection of Hinton’s recent comments on AGI risk

Kaj_SotalaMay 4, 2023, 11:31 PM

148 points

58 votes

Overall karma indicates overall quality.

9 comments11 min readLW link

Robin Hanson and I talk about AI risk

KatjaGraceMay 4, 2023, 10:20 PM

39 points

11 votes

Overall karma indicates overall quality.

8 comments1 min readLW link

(worldspiritsockpuppet.com)

Who regulates the regulators? We need to go beyond the review-and-approval paradigm

jasoncrawfordMay 4, 2023, 10:11 PM

122 points

41 votes

Overall karma indicates overall quality.

29 comments13 min readLW link

(rootsofprogress.org)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer