All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Dan H and Orpheus16

May 9, 2023, 3:26 PM

28 points

1 comment4 min readLW link

(newsletter.safe.ai)

A Search for More ChatGPT / GPT-3.5 / GPT-4 “Unspeakable” Glitch Tokens

Martin FellMay 9, 2023, 2:36 PM

26 points

9 comments6 min readLW link

How to Interpret Prediction Market Prices as Probabilities

SimonMMay 9, 2023, 2:12 PM

14 points

1 comment4 min readLW link

Stampy’s AI Safety Info—New Distillations #2 [April 2023]

markovMay 9, 2023, 1:31 PM

25 points

1 comment1 min readLW link

(aisafety.info)

Quote quiz answer

jasoncrawfordMay 9, 2023, 1:27 PM

19 points

0 comments4 min readLW link

(rootsofprogress.org)

[Question] Does reversible computation let you compute the complexity class PSPACE as efficiently as normal computers compute the complexity class P?

Noosphere89May 9, 2023, 1:18 PM

6 points

14 comments1 min readLW link

EconTalk podcast: “Eliezer Yudkowsky on the Dangers of AI”

TekhneMakreMay 9, 2023, 11:14 AM

15 points

1 comment1 min readLW link

(www.econtalk.org)

Most people should probably feel safe most of the time

Kaj_SotalaMay 9, 2023, 9:35 AM

95 points

28 comments10 min readLW link

Summaries of top forum posts (1st to 7th May 2023)

Zoe WilliamsMay 9, 2023, 9:30 AM

21 points

0 comments LW link

Focusing on longevity research as a way to avoid the AI apocalypse

Random TraderMay 9, 2023, 4:47 AM

14 points

2 comments2 min readLW link

When is Goodhart catastrophic?

Drake Thomas and Thomas Kwa

May 9, 2023, 3:59 AM

180 points

29 comments8 min readLW link 1 review

Chilean AIS Hackathon Retrospective

agucovaMay 9, 2023, 1:34 AM

9 points

0 comments LW link

Announcing “Key Phenomena in AI Risk” (facilitated reading group)

Nora_Ammann and particlemania

May 9, 2023, 12:31 AM

65 points

4 comments2 min readLW link

Yoshua Bengio argues for tool-AI and to ban “executive-AI”

habrykaMay 9, 2023, 12:13 AM

53 points

15 comments7 min readLW link

(yoshuabengio.org)

South Bay ACX/LW Meetup

ISMay 8, 2023, 11:55 PM

2 points

0 comments1 min readLW link

H-JEPA might be technically alignable in a modified form

Roman LeventovMay 8, 2023, 11:04 PM

12 points

2 comments7 min readLW link

All AGI Safety questions welcome (especially basic ones) [May 2023]

steven0461May 8, 2023, 10:30 PM

33 points

44 comments2 min readLW link

Predictable updating about AI risk

Joe CarlsmithMay 8, 2023, 9:53 PM

294 points

25 comments36 min readLW link 1 review

Annotated reply to Bengio’s “AI Scientists: Safe and Useful AI?”

Roman LeventovMay 8, 2023, 9:26 PM

18 points

2 comments7 min readLW link

(yoshuabengio.org)

Are healthy choices effective for improving live expectancy anymore?

Christopher KingMay 8, 2023, 9:25 PM

4 points

4 comments1 min readLW link

LeCun’s “A Path Towards Autonomous Machine Intelligence” has an unsolved technical alignment problem

Steven ByrnesMay 8, 2023, 7:35 PM

140 points

37 comments15 min readLW link

Product Endorsement: Apollo Neuro

ElizabethMay 8, 2023, 7:00 PM

46 points

28 comments5 min readLW link

(acesounderglass.com)

Acausal trade naturally results in the Nash bargaining solution

Christopher KingMay 8, 2023, 6:13 PM

3 points

0 comments4 min readLW link

Inference Speed is Not Unbounded

OneManyNoneMay 8, 2023, 4:24 PM

35 points

32 comments16 min readLW link

[Crosspost] Unveiling the American Public Opinion on AI Moratorium and Government Intervention: The Impact of Media Exposure

otto.bartenMay 8, 2023, 2:09 PM

7 points

0 comments6 min readLW link

(forum.effectivealtruism.org)

Thriving in the Weird Times: Preparing for the 100X Economy

Lucie Philippon and Charbel-Raphaël

May 8, 2023, 1:44 PM

23 points

16 comments2 min readLW link

Housing and Transit Roundup #4

ZviMay 8, 2023, 1:30 PM

25 points

0 comments11 min readLW link

(thezvi.wordpress.com)

Dance Profit Sharing

jefftkMay 8, 2023, 1:10 PM

11 points

3 comments2 min readLW link

(www.jefftk.com)

How “AGI” could end up being many different specialized AI’s stitched together

titotalMay 8, 2023, 12:32 PM

9 points

2 comments LW link

What does it take to ban a thing?

qbolecMay 8, 2023, 11:00 AM

66 points

18 comments5 min readLW link

Solomonoff’s solipsism

Mergimio H. DoefevmilMay 8, 2023, 6:55 AM

−13 points

9 comments1 min readLW link

A technical note on bilinear layers for interpretability

Lee SharkeyMay 8, 2023, 6:06 AM

59 points

0 comments1 min readLW link

(arxiv.org)

[Question] Is EDT correct? Does “EDT” == “logical EDT” == “logical CDT”?

Vivek HebbarMay 8, 2023, 2:07 AM

13 points

2 comments1 min readLW link

LLM cognition is probably not human-like

Max HMay 8, 2023, 1:22 AM

26 points

15 comments7 min readLW link

[Question] If alignment problem was unsolvable, would that avoid doom?

KinranyMay 7, 2023, 10:13 PM

3 points

3 comments1 min readLW link

An artificially structured argument for expecting AGI ruin

Rob BensingerMay 7, 2023, 9:52 PM

91 points

26 comments19 min readLW link

Where “the Sequences” Are Wrong

Thoth HermesMay 7, 2023, 8:21 PM

−15 points

5 comments14 min readLW link

(thothhermes.substack.com)

What’s wrong with being dumb?

Adam ZernerMay 7, 2023, 6:31 PM

14 points

17 comments2 min readLW link

Categories of Arguing Style : Why being good among rationalists isn’t enough to argue with everyone

Camille Berger May 7, 2023, 5:45 PM

16 points

0 comments23 min readLW link

Self-Administered Gell-Mann Amnesia

krsMay 7, 2023, 5:44 PM

1 point

1 comment1 min readLW link

Understanding mesa-optimization using toy models

tilmanr, rusheb, Guillaume Corlouer, Dan Valentine, afspies, mivanitskiy and Can

May 7, 2023, 5:00 PM

45 points

6 comments10 min readLW link

How to have Polygenically Screened Children

GeneSmithMay 7, 2023, 4:01 PM

367 points

128 comments27 min readLW link 1 review

Statistical models & the irrelevance of rare exceptions

patrissimoMay 7, 2023, 3:59 PM

36 points

6 comments2 min readLW link

Let’s look for coherence theorems

ValdesMay 7, 2023, 2:45 PM

25 points

18 comments6 min readLW link

Graphical Representations of Paul Christiano’s Doom Model

Nathan YoungMay 7, 2023, 1:03 PM

7 points

0 comments LW link

An anthropomorphic AI dilemma

TsviBTMay 7, 2023, 12:44 PM

26 points

0 comments7 min readLW link

Violin Supports

jefftkMay 7, 2023, 12:10 PM

12 points

1 comment1 min readLW link

(www.jefftk.com)

Properties of Good Textbooks

niplavMay 7, 2023, 8:38 AM

50 points

11 comments1 min readLW link

Against sacrificing AI transparency for generality gains

Ape in the coatMay 7, 2023, 6:52 AM

4 points

0 comments2 min readLW link

TED talk by Eliezer Yudkowsky: Unleashing the Power of Artificial Intelligence

bayesedMay 7, 2023, 5:45 AM

49 points

36 comments1 min readLW link

(www.youtube.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer