All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30 31

The Strange Science of Interpretability: Recent Papers and a Reading List for the Philosophy of Interpretability

Kola Ayonrinde17 Aug 2025 23:38 UTC

29 points

0 comments2 min readLW link

(arxiv.org)

The parable of the underdog

Said Achmiz17 Aug 2025 22:39 UTC

22 points

4 comments2 min readLW link

(www.datasecretslox.com)

Underdog bias rules everything around me

Richard_Ngo17 Aug 2025 19:21 UTC

191 points

56 comments7 min readLW link

(www.mindthefuture.info)

Apply for the 2025 Dovetail fellowship

Alex_Altair and Alfred Harwood

17 Aug 2025 19:09 UTC

42 points

2 comments4 min readLW link

Writing Out My Tunes

jefftk17 Aug 2025 17:00 UTC

11 points

2 comments3 min readLW link

(www.jefftk.com)

Plan E for AI Doom

Ihor Kendiukhov17 Aug 2025 15:26 UTC

72 points

15 comments3 min readLW link

[Question] Meaning in life—should I have it? How did you find yours?

Aprillion17 Aug 2025 9:49 UTC

13 points

21 comments3 min readLW link

Legal Personhood—Types of Consequences

Stephen Martin17 Aug 2025 6:52 UTC

6 points

0 comments4 min readLW link

Agent foundations: not really math, not really science

Alex_Altair17 Aug 2025 5:48 UTC

121 points

29 comments5 min readLW link

Why Latter-day Saints Have Strong Communities

Jeffrey Heninger17 Aug 2025 4:20 UTC

102 points

31 comments9 min readLW link

Immortalism—A Rational Case for Solving Death

vampiretooth17 Aug 2025 3:56 UTC

11 points

4 comments18 min readLW link

My Interview With Cade Metz on His Reporting About Lighthaven

Zack_M_Davis17 Aug 2025 2:30 UTC

157 points

15 comments5 min readLW link

On Pessimization

Richard_Ngo17 Aug 2025 1:10 UTC

59 points

3 comments10 min readLW link

(www.mindthefuture.info)

Debugging for Mid Coders

Raemon16 Aug 2025 22:32 UTC

82 points

41 comments7 min readLW link

Church Planting: When Venture Capital Finds Jesus

Elizabeth16 Aug 2025 19:40 UTC

242 points

23 comments16 min readLW link

(acesounderglass.com)

35 Thoughts About AGI and 1 About GPT-5

snewman16 Aug 2025 19:20 UTC

21 points

20 comments16 min readLW link

(secondthoughts.ai)

The Comprehensive Case Against Trump

Bentham's Bulldog16 Aug 2025 17:30 UTC

−11 points

34 comments26 min readLW link

The Collider Bias Theory of (Not Quite) Everything

Jack_S16 Aug 2025 16:53 UTC

90 points

3 comments10 min readLW link

How we hacked business school

Logan Kieller and Michael Samoilov

16 Aug 2025 15:22 UTC

17 points

2 comments6 min readLW link

(agenticconjectures.substack.com)

[Question] Why did interest in “AI risk” and “AI safety” spike in June and July 2025? (Google Trends)

WilliamKiely16 Aug 2025 15:22 UTC

32 points

4 comments1 min readLW link

Four types of approaches for your emotional problems

Kaj_Sotala16 Aug 2025 13:59 UTC

45 points

5 comments15 min readLW link

‘Just Tax Land’ - what’s the point?

Hruss16 Aug 2025 12:37 UTC

−3 points

1 comment1 min readLW link

(open.substack.com)

Mind Conditioning

Gabriel Alfour16 Aug 2025 11:20 UTC

5 points

0 comments1 min readLW link

(cognition.cafe)

Anthropic Lets Claude Opus 4 & 4.1 End Conversations

Stephen Martin16 Aug 2025 5:01 UTC

53 points

3 comments1 min readLW link

(www.anthropic.com)

The Inheritors: a book review

Alex_Altair16 Aug 2025 2:47 UTC

74 points

4 comments3 min readLW link

BIDA Masking and Attendance

jefftk16 Aug 2025 1:50 UTC

11 points

0 comments1 min readLW link

(www.jefftk.com)

Rights & Liberties—are opposites

James Stephen Brown16 Aug 2025 0:20 UTC

1 point

0 comments4 min readLW link

TT Self Study Journal # 4

TristanTrim15 Aug 2025 23:47 UTC

3 points

3 comments5 min readLW link

N Dimensional Interactive Scatter Plot (ndisp)

TristanTrim15 Aug 2025 23:08 UTC

10 points

3 comments12 min readLW link

SE Gyges’ response to AI-2027

StanislavKrym15 Aug 2025 21:54 UTC

32 points

13 comments46 min readLW link

(www.verysane.ai)

Towards data-centric interpretability with sparse autoencoders

Nick Jiang, lilysun004, lewis smith and Neel Nanda

15 Aug 2025 20:10 UTC

57 points

2 comments18 min readLW link

Music taste is (also) a next token prediction

eamag15 Aug 2025 17:49 UTC

6 points

0 comments2 min readLW link

(eamag.me)

Theory of culture as waste.

Laureana Bonaparte15 Aug 2025 17:34 UTC

−3 points

15 comments2 min readLW link

Spending Too Much Time At Airports

Zvi15 Aug 2025 16:10 UTC

59 points

24 comments7 min readLW link

(thezvi.wordpress.com)

How to make the future better (other than by reducing extinction risk)

wdmacaskill15 Aug 2025 15:40 UTC

17 points

1 comment3 min readLW link

Should you start a for-profit AI safety org?

KatWoods15 Aug 2025 13:52 UTC

8 points

4 comments1 min readLW link

How to get ChatGPT to really thoroughly research something

KatWoods15 Aug 2025 12:54 UTC

18 points

1 comment1 min readLW link

Thoughts on Gradual Disempowerment

Tom Davidson15 Aug 2025 11:56 UTC

65 points

32 comments19 min readLW link

Misalignment classifiers: Why they’re hard to evaluate adversarially, and why we’re studying them anyway

Charlie Griffin, ollie, oliverfm, Rogan Inglis and Alan Cooney

15 Aug 2025 11:48 UTC

68 points

3 comments17 min readLW link

A Phylogeny of Agents

Jonas Hallgren and markov

15 Aug 2025 10:47 UTC

40 points

12 comments6 min readLW link

(substack.com)

My kids won’t be workers

Gauraventh15 Aug 2025 7:06 UTC

3 points

0 comments6 min readLW link

(y1d2.com)

European Links (15.08.25)

Martin Sustrik15 Aug 2025 4:20 UTC

21 points

8 comments2 min readLW link

(www.250bpm.com)

Legal Personhood—Three Prong Bundle Theory

Stephen Martin15 Aug 2025 4:13 UTC

13 points

6 comments4 min readLW link

Mental Gymnastics.

Laureana Bonaparte15 Aug 2025 4:08 UTC

3 points

0 comments13 min readLW link

Rare AI and the Fermi Paradox

dawnstrata15 Aug 2025 4:05 UTC

11 points

6 comments9 min readLW link

Tristan’s Projects

TristanTrim15 Aug 2025 3:46 UTC

10 points

4 comments3 min readLW link

Trialing Far UVC and Glycol Vapors at BIDA

jefftk15 Aug 2025 2:20 UTC

19 points

1 comment2 min readLW link

(www.jefftk.com)

A philosophical kernel: biting analytic bullets

jessicata15 Aug 2025 1:35 UTC

64 points

21 comments13 min readLW link

(unstableontology.com)

A letter to Kyle Fish on the Retirement of Claude 3 Sonnet

bridgebot15 Aug 2025 1:08 UTC

−4 points

3 comments5 min readLW link

Conceptual Rhyme and Metaphor

Jordan Rubin15 Aug 2025 0:05 UTC

2 points

0 comments9 min readLW link

(jordanmrubin.substack.com)