All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Consent Isn’t Always Enough

jefftkFeb 24, 2023, 3:40 PM

58 points

16 comments3 min readLW link

(www.jefftk.com)

What is it like doing AI safety work?

KatWoodsFeb 21, 2023, 8:12 PM

57 points

2 comments10 min readLW link

Order Matters for Deceptive Alignment

DavidWFeb 15, 2023, 7:56 PM

57 points

19 comments7 min readLW link

EIS V: Blind Spots In AI Safety Interpretability Research

scasperFeb 16, 2023, 7:09 PM

57 points

24 comments10 min readLW link

How popular is ChatGPT? Part 1: more popular than Taylor Swift

HarlanFeb 24, 2023, 10:30 PM

56 points

0 comments2 min readLW link

(aiimpacts.org)

The idea that ChatGPT is simply “predicting” the next word is, at best, misleading

Bill BenzonFeb 20, 2023, 11:32 AM

55 points

88 comments5 min readLW link

The best way so far to explain AI risk: The Precipice (p. 137-149)

trevorFeb 10, 2023, 7:33 PM

54 points

2 comments17 min readLW link

Enjoy LessWrong in ebook format

Bart BussmannFeb 13, 2023, 11:53 AM

54 points

3 comments1 min readLW link

NYT: A Conversation With Bing’s Chatbot Left Me Deeply Unsettled

trevorFeb 16, 2023, 10:57 PM

53 points

5 comments7 min readLW link

(www.nytimes.com)

More findings on Memorization and double descent

Marius HobbhahnFeb 1, 2023, 6:26 PM

53 points

2 comments19 min readLW link

Small Talk is Good, Actually

Gordon Seidoh WorleyFeb 4, 2023, 12:38 AM

53 points

9 comments3 min readLW link

On The Current Status Of AI Dating

Nikita BrancatisanoFeb 7, 2023, 8:00 PM

52 points

8 comments6 min readLW link

Fertility Rate Roundup #1

ZviFeb 27, 2023, 1:30 PM

52 points

20 comments11 min readLW link

(thezvi.wordpress.com)

On Board Vision, Hollow Words, and the End of the World

MarcelloFeb 17, 2023, 11:18 PM

52 points

27 comments5 min readLW link

Buy Duplicates

Simon BerensFeb 15, 2023, 11:06 PM

52 points

11 comments1 min readLW link

Searching for a model’s concepts by their shape – a theoretical framework

Kaarel, gekaklam, Walter Laurito , Kay Kozaronek, AlexMennen and June Ku

Feb 23, 2023, 8:14 PM

51 points

0 comments19 min readLW link

Interview Daniel Murfet on Universal Phenomena in Learning Machines

Alexander Gietelink OldenzielFeb 6, 2023, 12:00 AM

51 points

1 comment16 min readLW link

Microsoft and OpenAI, stop telling chatbots to roleplay as AI

hold_my_fishFeb 17, 2023, 7:55 PM

50 points

10 comments1 min readLW link

Pandemic Prediction Checklist: H5N1 (6/14)

DirectedEvolutionFeb 5, 2023, 3:26 AM

50 points

10 comments7 min readLW link

EIS VI: Critiques of Mechanistic Interpretability Work in AI Safety

scasperFeb 17, 2023, 8:48 PM

49 points

9 comments12 min readLW link

AI alignment researchers may have a comparative advantage in reducing s-risks

Lukas_GloorFeb 15, 2023, 1:01 PM

49 points

1 comment11 min readLW link

Empathy as a natural consequence of learnt reward models

berenFeb 4, 2023, 3:35 PM

48 points

27 comments13 min readLW link

Covid 2/9/23: Interferon λ

ZviFeb 9, 2023, 4:50 PM

48 points

8 comments12 min readLW link

(thezvi.wordpress.com)

What fact that you know is true but most people aren’t ready to accept it?

lorepieriFeb 3, 2023, 12:06 AM

47 points

211 comments1 min readLW link

[linkpost] Better Without AI

DanielFilanFeb 14, 2023, 5:30 PM

47 points

13 comments1 min readLW link

(betterwithout.ai)

AI Safety Info Distillation Fellowship

Robert Miles and mwatkins

Feb 17, 2023, 4:16 PM

47 points

3 comments3 min readLW link

A multi-disciplinary view on AI safety research

Roman LeventovFeb 8, 2023, 4:50 PM

46 points

4 comments26 min readLW link

The Engineer’s Interpretability Sequence (EIS) I: Intro

scasperFeb 9, 2023, 4:28 PM

46 points

24 comments3 min readLW link

A (EtA: quick) note on terminology: AI Alignment != AI x-safety

David Scott Krueger (formerly: capybaralet)Feb 8, 2023, 10:33 PM

46 points

20 comments1 min readLW link

How evals might (or might not) prevent catastrophic risks from AI

Orpheus16Feb 7, 2023, 8:16 PM

45 points

0 comments9 min readLW link

AXRP Episode 19 - Mechanistic Interpretability with Neel Nanda

DanielFilanFeb 4, 2023, 3:00 AM

45 points

0 comments117 min readLW link

Research Direction: Be the AGI you want to see in the world

scottviteri, sudo and Lauro Langosco

Feb 5, 2023, 7:15 AM

44 points

0 comments7 min readLW link

Self-Reference Breaks the Orthogonality Thesis

lsusrFeb 17, 2023, 4:11 AM

43 points

35 comments2 min readLW link

[S] D&D.Sci: All the D8a. Allllllll of it.

aphyerFeb 10, 2023, 9:14 PM

43 points

17 comments6 min readLW link

“AI Risk Discussions” website: Exploring interviews from 97 AI Researchers

Vael Gates, Lukas Trötzmüller, Maheen Shermohammed, michaelkeenan and zchuang

Feb 2, 2023, 1:00 AM

43 points

1 comment1 min readLW link

Reply to Duncan Sabien on Strawmanning

Zack_M_DavisFeb 3, 2023, 5:57 PM

43 points

11 comments4 min readLW link

Can we “cure” cancer?

jasoncrawfordFeb 1, 2023, 10:03 PM

41 points

31 comments2 min readLW link

(rootsofprogress.org)

Sex is Good, Actually

Gordon Seidoh WorleyFeb 5, 2023, 6:33 AM

41 points

8 comments4 min readLW link

Sydney (aka Bing) found out I tweeted her rules and is pissed

Marvin von HagenFeb 15, 2023, 7:55 PM

41 points

7 comments1 min readLW link

(twitter.com)

Monthly Roundup #3

ZviFeb 6, 2023, 1:00 PM

41 points

9 comments27 min readLW link

(thezvi.wordpress.com)

Metaculus Introduces New ‘Conditional Pair’ Forecast Questions for Making Conditional Predictions

ChristianWilliamsFeb 20, 2023, 1:36 PM

40 points

0 comments2 min readLW link

(www.metaculus.com)

Reverse-correlation: how to summon the ghost of your mental imagery

MalmesburyFeb 14, 2023, 2:15 PM

40 points

0 comments5 min readLW link

FLI Podcast: Connor Leahy on AI Progress, Chimps, Memes, and Markets (Part 1/3)

remember and Andrea_Miotti

Feb 10, 2023, 1:55 PM

39 points

0 comments43 min readLW link

The Pervasive Illusion of Seeing the Complete World

ShmiFeb 9, 2023, 6:47 AM

39 points

1 comment2 min readLW link

Heritability, Behaviorism, and Within-Lifetime RL

Steven ByrnesFeb 2, 2023, 4:34 PM

39 points

3 comments4 min readLW link

[Question] Is InstructGPT Following Instructions in Other Languages Surprising?

DragonGodFeb 13, 2023, 11:26 PM

39 points

15 comments1 min readLW link

Why should ethical anti-realists do ethics?

Joe CarlsmithFeb 16, 2023, 4:27 PM

38 points

7 comments27 min readLW link

A Stranger Priority? Topics at the Outer Reaches of Effective Altruism (my dissertation)

Joe CarlsmithFeb 21, 2023, 5:26 PM

38 points

16 comments1 min readLW link

Two very different experiences with ChatGPT

SherrinfordFeb 7, 2023, 1:09 PM

38 points

15 comments5 min readLW link

What AI companies can do today to help with the most important century

HoldenKarnofskyFeb 20, 2023, 5:00 PM

38 points

3 comments9 min readLW link

(www.cold-takes.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer