All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

The Overton Window widens: Examples of AI risk in the media

Orpheus16Mar 23, 2023, 5:10 PM

107 points

24 comments6 min readLW link

Want to predict/explain/control the output of GPT-4? Then learn about the world, not about transformers.

Cleo NardoMar 16, 2023, 3:08 AM

107 points

26 comments5 min readLW link

Predictions for shard theory mechanistic interpretability results

TurnTrout, Ulisse Mini and peligrietzer

Mar 1, 2023, 5:16 AM

105 points

10 comments5 min readLW link

Introducing Leap Labs, an AI interpretability startup

Jessica RumbelowMar 6, 2023, 4:16 PM

103 points

12 comments1 min readLW link

On the FLI Open Letter

ZviMar 30, 2023, 4:00 PM

102 points

11 comments22 min readLW link

(thezvi.wordpress.com)

AI #4: Introducing GPT-4

ZviMar 21, 2023, 2:00 PM

101 points

32 comments103 min readLW link

(thezvi.wordpress.com)

Selective, Corrective, Structural: Three Ways of Making Social Systems Work

Said AchmizMar 5, 2023, 8:45 AM

100 points

13 comments2 min readLW link

LLM Modularity: The Separability of Capabilities in Large Language Models

NickyPMar 26, 2023, 9:57 PM

99 points

3 comments41 min readLW link

Truth and Advantage: Response to a draft of “AI safety seems hard to measure”

So8resMar 22, 2023, 3:36 AM

98 points

10 comments5 min readLW link 1 review

Learn the mathematical structure, not the conceptual structure

Adam ShaiMar 1, 2023, 10:24 PM

98 points

35 comments2 min readLW link

New blog: Planned Obsolescence

Ajeya CotraMar 27, 2023, 7:46 PM

96 points

7 comments1 min readLW link

(www.planned-obsolescence.org)

RLHF does not appear to differentially cause mode-collapse

Arthur Conmy and beren

Mar 20, 2023, 3:39 PM

95 points

9 comments3 min readLW link

AI #5: Level One Bard

ZviMar 30, 2023, 11:00 PM

95 points

9 comments47 min readLW link

(thezvi.wordpress.com)

Nobody’s on the ball on AGI alignment

leopoldMar 29, 2023, 5:40 PM

94 points

38 comments9 min readLW link

(www.forourposterity.com)

Shell games

TsviBTMar 19, 2023, 10:43 AM

93 points

9 comments4 min readLW link 1 review

Abstracts should be either Actually Short™, or broken into paragraphs

RaemonMar 24, 2023, 12:51 AM

93 points

27 comments5 min readLW link

Google’s PaLM-E: An Embodied Multimodal Language Model

SandXboxMar 7, 2023, 4:11 AM

87 points

7 comments1 min readLW link

(palm-e.github.io)

Practical Pitfalls of Causal Scrubbing

Jérémy Scheurer, Phil3, tony, jacquesthibs and David Lindner

Mar 27, 2023, 7:47 AM

87 points

17 comments13 min readLW link

reflections on lockdown, two years out

mingyuanMar 1, 2023, 6:58 AM

86 points

9 comments3 min readLW link

Contract Fraud

jefftkMar 1, 2023, 3:10 AM

86 points

10 comments1 min readLW link

(www.jefftk.com)

The epistemic virtue of scope matching

jasoncrawfordMar 15, 2023, 1:31 PM

85 points

15 comments5 min readLW link

(rootsofprogress.org)

The Kids are Not Okay

ZviMar 8, 2023, 1:30 PM

85 points

43 comments32 min readLW link

(thezvi.wordpress.com)

The 0.2 OOMs/year target

Cleo NardoMar 30, 2023, 6:15 PM

84 points

24 comments5 min readLW link

$500 Bounty/Contest: Explain Infra-Bayes In The Language Of Game Theory

johnswentworthMar 25, 2023, 5:29 PM

83 points

7 comments2 min readLW link

Yudkowsky on AGI risk on the Bankless podcast

Rob BensingerMar 13, 2023, 12:42 AM

83 points

5 comments LW link

[Question] Are there specific books that it might slightly help alignment to have on the internet?

AnnaSalamonMar 29, 2023, 5:08 AM

77 points

25 comments1 min readLW link

Sunlight is yellow parallel rays plus blue isotropic light

Thomas KehrenbergMar 1, 2023, 5:58 PM

77 points

5 comments2 min readLW link

How to Support Someone Who is Struggling

David ZellerMar 11, 2023, 6:52 PM

76 points

13 comments5 min readLW link

Success without dignity: a nearcasting story of avoiding catastrophe by luck

HoldenKarnofskyMar 14, 2023, 7:23 PM

76 points

17 comments15 min readLW link

Response to Tyler Cowen’s Existential risk, AI, and the inevitable turn in human history

ZviMar 28, 2023, 4:00 PM

72 points

27 comments20 min readLW link

(thezvi.wordpress.com)

A bunch of videos for intuition building (2x speed, skip ones that bore you)

the gears to ascensionMar 12, 2023, 12:51 AM

72 points

5 comments4 min readLW link

Microsoft Research Paper Claims Sparks of Artificial Intelligence in GPT-4

ZviMar 24, 2023, 1:20 PM

72 points

14 comments6 min readLW link

(thezvi.wordpress.com)

Imitation Learning from Language Feedback

Jérémy Scheurer, Tomek Korbak and Ethan Perez

Mar 30, 2023, 2:11 PM

71 points

3 comments10 min readLW link

Dealing with infinite entropy

Alex_AltairMar 1, 2023, 3:01 PM

70 points

9 comments11 min readLW link

AI Safety in a World of Vulnerable Machine Learning Systems

AdamGleave and EuanMcLean

Mar 8, 2023, 2:40 AM

70 points

29 comments29 min readLW link

(far.ai)

Probabilistic Payor Lemma?

abramdemskiMar 19, 2023, 5:57 PM

69 points

7 comments4 min readLW link

Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Microsoft Research

DragonGodMar 23, 2023, 5:45 AM

68 points

23 comments1 min readLW link

(arxiv.org)

Plan for mediocre alignment of brain-like [model-based RL] AGI

Steven ByrnesMar 13, 2023, 2:11 PM

68 points

25 comments12 min readLW link

AI #2

ZviMar 2, 2023, 2:50 PM

66 points

18 comments55 min readLW link

(thezvi.wordpress.com)

Tabooing “Frame Control”

RaemonMar 19, 2023, 11:33 PM

66 points

41 comments10 min readLW link

[Question] What happened to the OpenPhil OpenAI board seat?

ChristianKlMar 15, 2023, 4:59 PM

65 points

2 comments1 min readLW link

Japan AI Alignment Conference

Chris Scammell and Katrina Joslin

Mar 10, 2023, 6:56 AM

64 points

7 comments1 min readLW link

(www.conjecture.dev)

Sydney can play chess and kind of keep track of the board state

Erik JennerMar 3, 2023, 9:39 AM

64 points

19 comments6 min readLW link

Some common confusion about induction heads

Alexandre VariengienMar 28, 2023, 9:51 PM

64 points

4 comments5 min readLW link

Transcript: NBC Nightly News: AI ‘race to recklessness’ w/ Tristan Harris, Aza Raskin

WilliamKielyMar 23, 2023, 1:04 AM

63 points

4 comments3 min readLW link

Why do we assume there is a “real” shoggoth behind the LLM? Why not masks all the way down?

Robert_AIZIMar 9, 2023, 5:28 PM

63 points

48 comments2 min readLW link

Sam Altman on GPT-4, ChatGPT, and the Future of AI | Lex Fridman Podcast #367

Gabe MMar 25, 2023, 7:08 PM

63 points

4 comments2 min readLW link

(www.youtube.com)

Payor’s Lemma in Natural Language

Andrew_CritchMar 2, 2023, 12:22 PM

62 points

0 comments2 min readLW link

The Prospect of an AI Winter

Erich_GrunewaldMar 27, 2023, 8:55 PM

62 points

24 comments15 min readLW link

(www.erichgrunewald.com)

You Can’t Predict a Game of Pinball

Jeffrey HeningerMar 30, 2023, 12:40 AM

61 points

13 comments6 min readLW link 1 review

(aiimpacts.org)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer