All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] Term/Category for AI with Neutral Impact?

isomicMay 11, 2023, 10:00 PM

6 points

1 comment1 min readLW link

Thoughts on LessWrong norms, the Art of Discourse, and moderator mandate

RubyMay 11, 2023, 9:20 PM

37 points

20 comments5 min readLW link

Alignment, Goals, and The Gut-Head Gap: A Review of Ngo. et al.

Violet HourMay 11, 2023, 6:06 PM

20 points

2 comments13 min readLW link

Sequence opener: Jordan Harbinger’s 6 minute networking

Severin T. SeehrichMay 11, 2023, 5:06 PM

4 points

0 comments1 min readLW link

Advice for newly busy people

Severin T. SeehrichMay 11, 2023, 4:46 PM

150 points

3 comments5 min readLW link

AI #11: In Search of a Moat

ZviMay 11, 2023, 3:40 PM

67 points

28 comments81 min readLW link

(thezvi.wordpress.com)

[Question] Bayesian update from sensationalistic sources

houkimeMay 11, 2023, 3:26 PM

1 point

0 comments1 min readLW link

I bet $500 on AI winning the IMO gold medal by 2026

azsantoskMay 11, 2023, 2:46 PM

37 points

29 comments1 min readLW link

Fatebook for Slack: Track your forecasts, right where your team works

Sage Future and Adam B

May 11, 2023, 2:11 PM

24 points

3 comments1 min readLW link

Contra Caller Signs

jefftkMay 11, 2023, 1:10 PM

10 points

0 comments1 min readLW link

(www.jefftk.com)

Notes on the importance and implementation of safety-first cognitive architectures for AI

Brendon_WongMay 11, 2023, 10:03 AM

3 points

0 comments3 min readLW link

A more grounded idea of AI risk

IknownothingMay 11, 2023, 9:48 AM

3 points

4 comments1 min readLW link

Separating the “control problem” from the “alignment problem”

Yi-YangMay 11, 2023, 9:41 AM

12 points

1 comment4 min readLW link

[Question] Is Infra-Bayesianism Applicable to Value Learning?

RogerDearnaleyMay 11, 2023, 8:17 AM

5 points

4 comments1 min readLW link

[Question] How should we think about the decision relevance of models estimating p(doom)?

Mo PuteraMay 11, 2023, 4:16 AM

11 points

1 comment3 min readLW link

The Academic Field Pyramid—any point to encouraging broad but shallow AI risk engagement?

Matthew_OpitzMay 11, 2023, 1:32 AM

20 points

1 comment6 min readLW link

[Question] How should one feel morally about using chatbots?

Adam ZernerMay 11, 2023, 1:01 AM

18 points

4 comments1 min readLW link

[Question] AI interpretability could be harmful?

Roman LeventovMay 10, 2023, 8:43 PM

13 points

2 comments1 min readLW link

Athens, Greece – ACX Meetups Everywhere Spring 2023

Spyros DovasMay 10, 2023, 7:45 PM

1 point

0 comments1 min readLW link

Better debates

TsviBTMay 10, 2023, 7:34 PM

78 points

7 comments3 min readLW link

Mental Health and the Alignment Problem: A Compilation of Resources (updated April 2023)

Chris Scammell and DivineMango

May 10, 2023, 7:04 PM

256 points

54 comments21 min readLW link

A Corrigibility Metaphore—Big Gambles

WCargoMay 10, 2023, 6:13 PM

16 points

0 comments4 min readLW link

Roadmap for a collaborative prototype of an Open Agency Architecture

Deger TuranMay 10, 2023, 5:41 PM

31 points

0 comments12 min readLW link

AGI-Automated Interpretability is Suicide

__RicG__May 10, 2023, 2:20 PM

25 points

33 comments7 min readLW link

Class-Based Addressing

jefftkMay 10, 2023, 1:40 PM

22 points

6 comments1 min readLW link

(www.jefftk.com)

In defence of epistemic modesty [distillation]

LuiseMay 10, 2023, 9:44 AM

17 points

2 comments9 min readLW link

[Question] How much of a concern are open-source LLMs in the short, medium and long terms?

JavierCCMay 10, 2023, 9:14 AM

5 points

0 comments1 min readLW link

10 great reasons why Lex Fridman should invite Eliezer and Robin to re-do the FOOM debate on his podcast

chaosmageMay 10, 2023, 8:27 AM

−7 points

1 comment1 min readLW link

(www.reddit.com)

New OpenAI Paper—Language models can explain neurons in language models

MrThinkMay 10, 2023, 7:46 AM

47 points

14 comments1 min readLW link

Naturalist Experimentation

LoganStrohlMay 10, 2023, 4:28 AM

62 points

14 comments10 min readLW link

[Question] Could A Superintelligence Out-Argue A Doomer?

tjaffeeMay 10, 2023, 2:40 AM

−16 points

6 comments1 min readLW link

Gradient hacking via actual hacking

Max HMay 10, 2023, 1:57 AM

12 points

7 comments3 min readLW link

Red teaming: challenges and research directions

joshcMay 10, 2023, 1:40 AM

31 points

1 comment10 min readLW link

[Question] Looking for a post I read if anyone recognizes it

SilverFlameMay 10, 2023, 1:24 AM

2 points

2 comments1 min readLW link

Research Report: Incorrectness Cascades (Corrected)

Robert_AIZIMay 9, 2023, 9:54 PM

9 points

0 comments9 min readLW link

(aizi.substack.com)

Stopping dangerous AI: Ideal US behavior

Zach Stein-PerlmanMay 9, 2023, 9:00 PM

17 points

0 comments3 min readLW link

Stopping dangerous AI: Ideal lab behavior

Zach Stein-PerlmanMay 9, 2023, 9:00 PM

8 points

0 comments2 min readLW link

Progress links and tweets, 2023-05-09

jasoncrawfordMay 9, 2023, 8:22 PM

14 points

0 comments2 min readLW link

(rootsofprogress.org)

[Question] Have you heard about MIT’s “liquid neural networks”? What do you think about them?

PpauMay 9, 2023, 8:16 PM

35 points

14 comments1 min readLW link

Respect for Boundaries as non-arbirtrary coordination norms

Jonas HallgrenMay 9, 2023, 7:42 PM

9 points

3 comments7 min readLW link

Solving the Mechanistic Interpretability challenges: EIS VII Challenge 1

StefanHex and Marius Hobbhahn

May 9, 2023, 7:41 PM

119 points

1 comment10 min readLW link

Forecasting as a tool for teaching the general public to make better judgements?

Dominik Hajduk | České priorityMay 9, 2023, 5:35 PM

3 points

0 comments3 min readLW link

Language models can explain neurons in language models

nzMay 9, 2023, 5:29 PM

23 points

0 comments1 min readLW link

(openai.com)

Asimov on building robots without the First Law

rossryMay 9, 2023, 4:44 PM

4 points

1 comment2 min readLW link

Making Up Baby Signs

jefftkMay 9, 2023, 4:40 PM

44 points

6 comments2 min readLW link

(www.jefftk.com)

Exciting New Interpretability Paper!

research_prime_spaceMay 9, 2023, 4:39 PM

12 points

1 comment1 min readLW link

Result Of The Bounty/Contest To Explain Infra-Bayes In The Language Of Game Theory

johnswentworthMay 9, 2023, 4:35 PM

79 points

0 comments1 min readLW link

The Bleak Harmony of Diets and Survival: A Glimpse into Nature’s Unforgiving Balance

bardstaleMay 9, 2023, 4:08 PM

−16 points

0 comments1 min readLW link

Entropic Abyss

bardstaleMay 9, 2023, 3:59 PM

−12 points

0 comments2 min readLW link

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Dan H and Orpheus16

May 9, 2023, 3:26 PM

28 points

1 comment4 min readLW link

(newsletter.safe.ai)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer