All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

The Talk: a brief explanation of sexual dimorphism

MalmesburySep 18, 2023, 4:23 PM

526 points

75 comments16 min readLW link 3 reviews

Inside Views, Impostor Syndrome, and the Great LARP

johnswentworthSep 25, 2023, 4:08 PM

335 points

53 comments5 min readLW link

Sharing Information About Nonlinear

Ben PaceSep 7, 2023, 6:51 AM

323 points

323 comments34 min readLW link

EA Vegan Advocacy is not truthseeking, and it’s everyone’s problem

ElizabethSep 28, 2023, 11:30 PM

323 points

250 comments22 min readLW link 2 reviews

(acesounderglass.com)

Sum-threshold attacks

TsviBTSep 8, 2023, 5:13 PM

238 points

55 comments10 min readLW link

(tsvibt.blogspot.com)

What I would do if I wasn’t at ARC Evals

LawrenceCSep 5, 2023, 7:19 PM

220 points

10 comments13 min readLW link 1 review

UDT shows that decision theory is more puzzling than ever

Wei DaiSep 13, 2023, 12:26 PM

218 points

56 comments1 min readLW link

AI presidents discuss AI alignment agendas

TurnTrout and Garrett Baker

Sep 9, 2023, 6:55 PM

217 points

23 comments1 min readLW link

(www.youtube.com)

The King and the Golem

Richard_NgoSep 25, 2023, 7:51 PM

190 points

19 comments5 min readLW link 1 review

(narrativeark.substack.com)

A Golden Age of Building? Excerpts and lessons from Empire State, Pentagon, Skunk Works and SpaceX

Bird ConceptSep 1, 2023, 4:03 AM

188 points

26 comments24 min readLW link 1 review

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

JanB, Owain_Evans and SoerenMind

Sep 28, 2023, 6:53 PM

187 points

39 comments3 min readLW link 1 review

There should be more AI safety orgs

Marius HobbhahnSep 21, 2023, 2:53 PM

181 points

25 comments17 min readLW link

Defunding My Mistake

ymeskhoutSep 4, 2023, 2:43 PM

175 points

41 comments6 min readLW link

Meta Questions about Metaphilosophy

Wei DaiSep 1, 2023, 1:17 AM

161 points

80 comments3 min readLW link

“Diamondoid bacteria” nanobots: deadly threat or dead-end? A nanotech investigation

titotalSep 29, 2023, 2:01 PM

160 points

79 comments LW link

(titotal.substack.com)

Sparse Autoencoders Find Highly Interpretable Directions in Language Models

Logan Riggs, Hoagy, Aidan Ewart and Robert_AIZI

Sep 21, 2023, 3:30 PM

159 points

8 comments5 min readLW link

Cohabitive Games so Far

mako yassSep 28, 2023, 3:41 PM

131 points

146 comments19 min readLW link 2 reviews

(makopool.com)

One Minute Every Moment

abramdemskiSep 1, 2023, 8:23 PM

125 points

23 comments3 min readLW link

The smallest possible button (or: moth traps!)

Neil Sep 2, 2023, 3:24 PM

122 points

18 comments3 min readLW link

(neilwarren.substack.com)

Paper: LLMs trained on “A is B” fail to learn “B is A”

lberglund, Owain_Evans, Meg, Maximilian Kaufmann, Mikita Balesni, Asa Cooper Stickland and Tomek Korbak

Sep 23, 2023, 7:55 PM

121 points

74 comments4 min readLW link

(arxiv.org)

Making AIs less likely to be spiteful

Nicolas Macé, Anthony DiGiovanni and JesseClifton

Sep 26, 2023, 2:12 PM

118 points

7 comments10 min readLW link

Interpreting OpenAI’s Whisper

EllenaRSep 24, 2023, 5:53 PM

116 points

13 comments7 min readLW link

“X distracts from Y” as a thinly-disguised fight over group status / politics

Steven ByrnesSep 25, 2023, 3:18 PM

112 points

14 comments8 min readLW link

Paper: On measuring situational awareness in LLMs

Owain_Evans, Daniel Kokotajlo, Mikita Balesni, Tomek Korbak, Asa Cooper Stickland, Meg and Maximilian Kaufmann

Sep 4, 2023, 12:54 PM

109 points

16 comments5 min readLW link

(arxiv.org)

ActAdd: Steering Language Models without Optimization

technicalities, TurnTrout, lisathiergart, David Udell, Ulisse Mini and Monte M

Sep 6, 2023, 5:21 PM

105 points

3 comments2 min readLW link

(arxiv.org)

PSA: The community is in Berkeley/Oakland, not “the Bay Area”

maiaSep 11, 2023, 3:59 PM

105 points

7 comments1 min readLW link

Reproducing ARC Evals’ recent report on language model agents

Thomas BroadleySep 1, 2023, 4:52 PM

104 points

17 comments3 min readLW link

(thomasbroadley.com)

Explaining grokking through circuit efficiency

Vikrant Varma and Rohin Shah

Sep 8, 2023, 2:39 PM

101 points

11 comments3 min readLW link

(arxiv.org)

Would You Work Harder In The Least Convenient Possible World?

FirinnSep 22, 2023, 5:17 AM

99 points

100 comments9 min readLW link 2 reviews

Closing Notes on Nonlinear Investigation

Ben PaceSep 15, 2023, 10:44 PM

97 points

47 comments11 min readLW link

Atoms to Agents Proto-Lectures

johnswentworthSep 22, 2023, 6:22 AM

96 points

14 comments2 min readLW link

(www.youtube.com)

Announcing FAR Labs, an AI safety coworking space

Ben GoldhaberSep 29, 2023, 4:52 PM

95 points

0 comments1 min readLW link

Logical Share Splitting

DaemonicSigilSep 11, 2023, 4:08 AM

93 points

16 comments9 min readLW link

(pbement.com)

I compiled a ebook of `Project Lawful` for eBook readers

OrwellGoesShoppingSep 15, 2023, 6:09 PM

90 points

4 comments1 min readLW link

(www.mikescher.com)

AI #31: It Can Do What Now?

ZviSep 28, 2023, 4:00 PM

90 points

6 comments40 min readLW link

(thezvi.wordpress.com)

Benchmarks for Detecting Measurement Tampering [Redwood Research]

ryan_greenblatt and Fabien Roger

Sep 5, 2023, 4:44 PM

87 points

22 comments20 min readLW link 1 review

(arxiv.org)

Highlights: Wentworth, Shah, and Murphy on “Retargeting the Search”

RobertMSep 14, 2023, 2:18 AM

87 points

4 comments8 min readLW link

Anthropic’s Responsible Scaling Policy & Long-Term Benefit Trust

Zac Hatfield-DoddsSep 19, 2023, 3:09 PM

85 points

26 comments3 min readLW link 1 review

(www.anthropic.com)

[Question] How have you become more hard-working?

Chi NguyenSep 25, 2023, 12:37 PM

82 points

42 comments LW link

Navigating an ecosystem that might or might not be bad for the world

habryka and kave

Sep 15, 2023, 11:58 PM

79 points

20 comments1 min readLW link

Memory bandwidth constraints imply economies of scale in AI inference

Ege ErdilSep 17, 2023, 2:01 PM

79 points

34 comments4 min readLW link

Find Hot French Food Near Me: A Follow-up

aphyerSep 6, 2023, 12:32 PM

75 points

19 comments2 min readLW link

Text Posts from the Kids Group: 2023 I

jefftkSep 5, 2023, 2:00 AM

75 points

3 comments7 min readLW link

(www.jefftk.com)

AI #30: Dalle-3 and GPT-3.5-Instruct-Turbo

ZviSep 21, 2023, 12:00 PM

75 points

8 comments47 min readLW link

(thezvi.wordpress.com)

Luck based medicine: angry eldritch sugar gods edition

ElizabethSep 19, 2023, 4:40 AM

75 points

14 comments9 min readLW link

(acesounderglass.com)

[Question] How to talk about reasons why AGI might not be near?

Kaj_SotalaSep 17, 2023, 8:18 AM

73 points

19 comments2 min readLW link

High-level interpretability: detecting an AI’s objectives

Paul Colognese and Jozdien

Sep 28, 2023, 7:30 PM

72 points

4 comments21 min readLW link

A quick update from Nonlinear

KatWoodsSep 7, 2023, 9:28 PM

72 points

23 comments2 min readLW link

Influence functions—why, what and how

Nina PanicksserySep 15, 2023, 8:42 PM

71 points

6 comments8 min readLW link

Have Attention Spans Been Declining?

niplavSep 8, 2023, 2:11 PM

71 points

22 comments17 min readLW link 1 review

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer