All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Failures in Kindness

silentbob26 Mar 2024 21:30 UTC

436 points

61 comments9 min readLW link 1 review

On green

Joe Carlsmith21 Mar 2024 17:38 UTC

339 points

47 comments31 min readLW link 3 reviews

My Clients, The Liars

ymeskhout5 Mar 2024 21:06 UTC

288 points

92 comments7 min readLW link 2 reviews

My PhD thesis: Algorithmic Bayesian Epistemology

Eric Neyman16 Mar 2024 22:56 UTC

262 points

14 comments7 min readLW link

(arxiv.org)

“How could I have thought that faster?”

mesaoptimizer11 Mar 2024 10:56 UTC

256 points

37 comments2 min readLW link 4 reviews

(twitter.com)

Modern Transformers are AGI, and Human-Level

abramdemski26 Mar 2024 17:46 UTC

232 points

88 comments5 min readLW link 1 review

ChatGPT can learn indirect control

Raymond Douglas21 Mar 2024 21:11 UTC

213 points

27 comments1 min readLW link

My Interview With Cade Metz on His Reporting About Slate Star Codex

Zack_M_Davis26 Mar 2024 17:18 UTC

211 points

187 comments6 min readLW link

Daniel Kahneman has died

DanielFilan27 Mar 2024 15:59 UTC

194 points

11 comments1 min readLW link

(www.washingtonpost.com)

Towards a Broader Conception of Adverse Selection

Ricki Heicklen14 Mar 2024 22:40 UTC

189 points

66 comments13 min readLW link 3 reviews

(bayesshammai.substack.com)

Many arguments for AI x-risk are wrong

TurnTrout5 Mar 2024 2:31 UTC

182 points

96 comments12 min readLW link 2 reviews

If you weren’t such an idiot...

kave and Mark Xu

2 Mar 2024 0:01 UTC

180 points

76 comments2 min readLW link

(markxu.com)

‘Empiricism!’ as Anti-Epistemology

Eliezer Yudkowsky14 Mar 2024 2:02 UTC

172 points

96 comments25 min readLW link 1 review

Some (problematic) aesthetics of what constitutes good work in academia

Steven Byrnes11 Mar 2024 17:47 UTC

158 points

12 comments12 min readLW link

Using axis lines for good or evil

dynomight6 Mar 2024 14:47 UTC

153 points

39 comments4 min readLW link

(dynomight.net)

Vernor Vinge, who coined the term “Technological Singularity”, dies at 79

Kaj_Sotala21 Mar 2024 22:14 UTC

151 points

25 comments1 min readLW link

(arstechnica.com)

On Devin

Zvi18 Mar 2024 13:20 UTC

148 points

35 comments11 min readLW link

(thezvi.wordpress.com)

The Worst Form Of Government (Except For Everything Else We’ve Tried)

johnswentworth17 Mar 2024 18:11 UTC

140 points

52 comments4 min readLW link 1 review

Read the Roon

Zvi5 Mar 2024 13:50 UTC

136 points

6 comments19 min readLW link

(thezvi.wordpress.com)

Community Notes by X

Niki Dupuis18 Mar 2024 17:13 UTC

129 points

15 comments7 min readLW link

Social status part 1/2: negotiations over object-level preferences

Steven Byrnes5 Mar 2024 16:29 UTC

119 points

16 comments21 min readLW link 1 review

The Parable Of The Fallen Pendulum—Part 1

johnswentworth1 Mar 2024 0:25 UTC

117 points

32 comments2 min readLW link

Simple versus Short: Higher-order degeneracy and error-correction

Daniel Murfet11 Mar 2024 7:52 UTC

115 points

12 comments13 min readLW link 3 reviews

Anthropic release Claude 3, claims >GPT-4 Performance

LawrenceC4 Mar 2024 18:23 UTC

115 points

41 comments2 min readLW link

(www.anthropic.com)

Notes from a Prompt Factory

Richard_Ngo10 Mar 2024 5:13 UTC

114 points

19 comments9 min readLW link

(www.narrativeark.xyz)

On attunement

Joe Carlsmith25 Mar 2024 12:47 UTC

111 points

13 comments22 min readLW link 1 review

SAE reconstruction errors are (empirically) pathological

wesg29 Mar 2024 16:37 UTC

108 points

16 comments8 min readLW link

LessOnline (May 31—June 2, Berkeley, CA)

Ben Pace26 Mar 2024 2:34 UTC

101 points

26 comments1 min readLW link

(Less.Online)

General Thoughts on Secular Solstice

Jeffrey Heninger23 Mar 2024 18:48 UTC

101 points

60 comments8 min readLW link

“Deep Learning” Is Function Approximation

Zack_M_Davis21 Mar 2024 17:50 UTC

99 points

29 comments10 min readLW link 1 review

(zackmdavis.net)

Stagewise Development in Neural Networks

Jesse Hoogland, Liam Carroll and Daniel Murfet

20 Mar 2024 19:54 UTC

96 points

1 comment11 min readLW link

Natural Latents: The Concepts

johnswentworth and David Lorell

20 Mar 2024 18:21 UTC

96 points

26 comments19 min readLW link

Announcing Neuronpedia: Platform for accelerating research into Sparse Autoencoders

Johnny Lin and Joseph Bloom

25 Mar 2024 21:17 UTC

96 points

7 comments7 min readLW link

Notes on Dwarkesh Patel’s Podcast with Demis Hassabis

Zvi1 Mar 2024 16:30 UTC

93 points

0 comments8 min readLW link

(thezvi.wordpress.com)

New report: Safety Cases for AI

joshc20 Mar 2024 16:45 UTC

92 points

14 comments1 min readLW link

(twitter.com)

OpenAI: The Board Expands

Zvi12 Mar 2024 14:00 UTC

92 points

1 comment30 min readLW link

(thezvi.wordpress.com)

Anxiety vs. Depression

Sable17 Mar 2024 0:15 UTC

91 points

34 comments3 min readLW link

(affablyevil.substack.com)

The Cognitive-Theoretic Model of the Universe: A Partial Summary and Review

jessicata27 Mar 2024 19:59 UTC

90 points

38 comments36 min readLW link

(unstablerontology.substack.com)

Introducing METR’s Autonomy Evaluation Resources

Megan Kinniment and Beth Barnes

15 Mar 2024 23:16 UTC

90 points

0 comments1 min readLW link

(metr.github.io)

[Question] What could a policy banning AGI look like?

TsviBT13 Mar 2024 14:19 UTC

80 points

23 comments3 min readLW link

The Parable Of The Fallen Pendulum—Part 2

johnswentworth12 Mar 2024 21:41 UTC

79 points

8 comments4 min readLW link

Grief is a fire sale

Nathan Young4 Mar 2024 1:11 UTC

77 points

1 comment4 min readLW link

Claude 3 claims it’s conscious, doesn’t want to die or be modified

Mikhail Samin4 Mar 2024 23:05 UTC

76 points

118 comments14 min readLW link

On Claude 3.0

Zvi6 Mar 2024 18:50 UTC

76 points

5 comments31 min readLW link

(thezvi.wordpress.com)

Vote on Anthropic Topics to Discuss

Ben Pace6 Mar 2024 19:43 UTC

75 points

55 comments1 min readLW link

“Artificial General Intelligence”: an extremely brief FAQ

Steven Byrnes11 Mar 2024 17:49 UTC

75 points

6 comments2 min readLW link

The World in 2029

Nathan Young2 Mar 2024 18:03 UTC

74 points

37 comments3 min readLW link

MATS AI Safety Strategy Curriculum

Ronny Fernandez and Ryan Kidd

7 Mar 2024 19:59 UTC

74 points

2 comments16 min readLW link

Nick Bostrom’s new book, “Deep Utopia”, is out today

peter_hartree27 Mar 2024 11:24 UTC

73 points

5 comments1 min readLW link

(nickbostrom.com)

Understanding SAE Features with the Logit Lens

Joseph Bloom and Johnny Lin

11 Mar 2024 0:16 UTC

71 points

2 comments14 min readLW link