All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 2930

Bengio’s FAQ on Catastrophic AI Risks

VaniverJun 29, 2023, 11:04 PM

39 points

13 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

(yoshuabengio.org)

AGI & War

CalecuteJun 29, 2023, 10:20 PM

9 points

5 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

Biosafety Regulations (BMBL) and their relevance for AI

Štěpán LosJun 29, 2023, 7:22 PM

4 points

2 votes

Overall karma indicates overall quality.

0 comments4 min readLW link

Nature Releases A Stupid Editorial On AI Risk

Bentham's BulldogJun 29, 2023, 7:00 PM

2 points

9 votes

Overall karma indicates overall quality.

1 comment3 min readLW link

AI Safety without Alignment: How humans can WIN against AI

vicchainJun 29, 2023, 5:53 PM

1 point

1 vote

Overall karma indicates overall quality.

1 comment2 min readLW link

Challenge proposal: smallest possible self-hardening backdoor for RLHF

Christopher KingJun 29, 2023, 4:56 PM

7 points

3 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

AI #18: The Great Debate Debate

ZviJun 29, 2023, 4:20 PM

47 points

21 votes

Overall karma indicates overall quality.

9 comments52 min readLW link

(thezvi.wordpress.com)

Bruce Sterling on the AI mania of 2023

Mitchell_PorterJun 29, 2023, 5:00 AM

25 points

13 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

(www.newsweek.com)

Cheat sheet of AI X-risk

momom2Jun 29, 2023, 4:28 AM

19 points

11 votes

Overall karma indicates overall quality.

1 comment7 min readLW link

Anthropically Blind: the anthropic shadow is reflectively inconsistent

Christopher KingJun 29, 2023, 2:36 AM

43 points

19 votes

Overall karma indicates overall quality.

40 comments10 min readLW link

One path to coherence: conditionalization

porbyJun 29, 2023, 1:08 AM

28 points

10 votes

Overall karma indicates overall quality.

4 comments4 min readLW link

AXRP announcement: Survey, Store Closing, Patreon

DanielFilanJun 28, 2023, 11:40 PM

14 points

5 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

Metaphors for AI, and why I don’t like them

boazbarakJun 28, 2023, 10:47 PM

44 points

22 votes

Overall karma indicates overall quality.

18 comments12 min readLW link

Transforming Democracy: A Unique Funding Opportunity for US Federal Approval Voting

Aaron HamlinJun 28, 2023, 10:07 PM

25 points

8 votes

Overall karma indicates overall quality.

6 comments2 min readLW link

AGI x Animal Welfare: A High-EV Outreach Opportunity?

simeon_cJun 28, 2023, 8:44 PM

29 points

14 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

A “weak” AGI may attempt an unlikely-to-succeed takeover

RobertMJun 28, 2023, 8:31 PM

56 points

25 votes

Overall karma indicates overall quality.

17 comments3 min readLW link

Progress links and tweets, 2023-06-28: “We can do big things again in Pennsylvania”

jasoncrawfordJun 28, 2023, 8:23 PM

14 points

6 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

(rootsofprogress.org)

[Question] What money-pumps exist, if any, for deontologists?

Daniel KokotajloJun 28, 2023, 7:08 PM

39 points

12 votes

Overall karma indicates overall quality.

35 comments1 min readLW link

[Question] What is your financial portfolio?

AlgonJun 28, 2023, 6:39 PM

11 points

5 votes

Overall karma indicates overall quality.

11 comments1 min readLW link

Levels of safety for AI and other technologies

jasoncrawfordJun 28, 2023, 6:35 PM

16 points

5 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

(rootsofprogress.org)

LeCun says making a utility function is intractable

IknownothingJun 28, 2023, 6:02 PM

2 points

1 vote

Overall karma indicates overall quality.

3 comments1 min readLW link

My research agenda in agent foundations

Alex_AltairJun 28, 2023, 6:00 PM

76 points

31 votes

Overall karma indicates overall quality.

9 comments11 min readLW link

AI Incident Sharing—Best practices from other fields and a comprehensive list of existing platforms

Štěpán LosJun 28, 2023, 5:21 PM

20 points

7 votes

Overall karma indicates overall quality.

0 comments4 min readLW link

The Case for Overconfidence is Overstated

Kevin DorstJun 28, 2023, 5:21 PM

50 points

24 votes

Overall karma indicates overall quality.

13 comments8 min readLW link

(kevindorst.substack.com)

When do “brains beat brawn” in Chess? An experiment

titotalJun 28, 2023, 1:33 PM

325 points

211 votes

Overall karma indicates overall quality.

106 comments7 min readLW link 2 reviews

(titotal.substack.com)

Giving an evolutionary explanation for Kahneman and Tversky’s insights on subjective satisfaction

LionelJun 28, 2023, 12:17 PM

−7 points

5 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

(lionelpage.substack.com)

Nature: “Stop talking about tomorrow’s AI doomsday when AI poses risks today”

Ben SmithJun 28, 2023, 5:59 AM

40 points

16 votes

Overall karma indicates overall quality.

8 comments2 min readLW link

(www.nature.com)

Request: Put Carl Shulman’s recent podcast into an organized written format

Aryeh EnglanderJun 28, 2023, 2:58 AM

19 points

10 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

Prediction Market: Will I Pull “The One Ring To Rule Them All?”

Connor TabarrokJun 28, 2023, 2:41 AM

1 point

4 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

(manifold.markets)

Carl Shulman on The Lunar Society (7 hour, two-part podcast)

ESRogsJun 28, 2023, 1:23 AM

79 points

28 votes

Overall karma indicates overall quality.

17 comments1 min readLW link

(www.dwarkeshpatel.com)

Brief summary of ai-plans.com

IknownothingJun 28, 2023, 12:33 AM

9 points

9 votes

Overall karma indicates overall quality.

4 comments2 min readLW link

(ai-plans.com)

Catastrophic Risks from AI #6: Discussion and FAQ

Dan H, Mantas Mazeika and TW123

Jun 27, 2023, 11:23 PM

24 points

7 votes

Overall karma indicates overall quality.

1 comment13 min readLW link

(arxiv.org)

Catastrophic Risks from AI #5: Rogue AIs

Dan H, Mantas Mazeika and TW123

Jun 27, 2023, 10:06 PM

15 points

5 votes

Overall karma indicates overall quality.

0 comments22 min readLW link

(arxiv.org)

AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence

Dan HJun 27, 2023, 5:20 PM

6 points

1 vote

Overall karma indicates overall quality.

0 comments7 min readLW link

(newsletter.safe.ai)

The Weight of the Future (Why The Apocalypse Can Be A Relief)

SableJun 27, 2023, 5:18 PM

18 points

20 votes

Overall karma indicates overall quality.

14 comments3 min readLW link

(affablyevil.substack.com)

Aligning AI by optimizing for “wisdom”

JustinShovelain and Elliot Mckernon

Jun 27, 2023, 3:20 PM

28 points

15 votes

Overall karma indicates overall quality.

8 comments12 min readLW link

Freedom under Naturalistic Dualism

Arturo MaciasJun 27, 2023, 2:34 PM

1 point

3 votes

Overall karma indicates overall quality.

36 comments1 min readLW link

(www.jneurophilosophy.com)

Munk AI debate: confusions and possible cruxes

Steven ByrnesJun 27, 2023, 2:18 PM

244 points

98 votes

Overall karma indicates overall quality.

21 comments8 min readLW link

Ateliers: Motivation

Stephen FowlerJun 27, 2023, 1:07 PM

7 points

3 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

Self-Blinded Caffeine RCT

niplavJun 27, 2023, 12:38 PM

45 points

29 votes

Overall karma indicates overall quality.

9 comments8 min readLW link

An overview of the points system

IknownothingJun 27, 2023, 9:09 AM

3 points

2 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

(ai-plans.com)

AISC team report: Soft-optimization, Bayes and Goodhart

Simon Fischer, benjaminko, jazcarretao, DFNaiff and Jeremy Gillen

Jun 27, 2023, 6:05 AM

38 points

18 votes

Overall karma indicates overall quality.

2 comments15 min readLW link

Epistemic spot checking one claim in The Precipice

Isaac KingJun 27, 2023, 1:03 AM

33 points

15 votes

Overall karma indicates overall quality.

3 comments1 min readLW link

nuclear costs are inflation

bhauthJun 26, 2023, 10:30 PM

8 points

13 votes

Overall karma indicates overall quality.

42 comments5 min readLW link

(www.bhauth.com)

Man in the Arena

Richard_NgoJun 26, 2023, 9:57 PM

66 points

50 votes

Overall karma indicates overall quality.

6 comments8 min readLW link

Catastrophic Risks from AI #4: Organizational Risks

Dan H, Mantas Mazeika and TW123

Jun 26, 2023, 7:36 PM

23 points

7 votes

Overall karma indicates overall quality.

0 comments21 min readLW link

(arxiv.org)

The fraught voyage of aligned novelty

TsviBTJun 26, 2023, 7:10 PM

15 points

7 votes

Overall karma indicates overall quality.

0 comments17 min readLW link

[Question] Deceptive AI vs. shifting instrumental incentives

Aryeh EnglanderJun 26, 2023, 6:09 PM

7 points

2 votes

Overall karma indicates overall quality.

2 comments3 min readLW link

On the Cost of Thriving Index

ZviJun 26, 2023, 3:30 PM

33 points

15 votes

Overall karma indicates overall quality.

6 comments9 min readLW link

(thezvi.wordpress.com)

“Safety Culture for AI” is important, but isn’t going to be easy

DavidmanheimJun 26, 2023, 12:52 PM

48 points

17 votes

Overall karma indicates overall quality.

2 comments2 min readLW link

(forum.effectivealtruism.org)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer