All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 151617 18 19 20 21 22 23 24 25 26 27 28 29 30 31

TT Self Study Journal # 4

TristanTrim15 Aug 2025 23:47 UTC

3 points

3 comments5 min readLW link

N Dimensional Interactive Scatter Plot (ndisp)

TristanTrim15 Aug 2025 23:08 UTC

10 points

3 comments12 min readLW link

SE Gyges’ response to AI-2027

StanislavKrym15 Aug 2025 21:54 UTC

32 points

13 comments46 min readLW link

(www.verysane.ai)

Towards data-centric interpretability with sparse autoencoders

Nick Jiang, lilysun004, lewis smith and Neel Nanda

15 Aug 2025 20:10 UTC

57 points

2 comments18 min readLW link

Music taste is (also) a next token prediction

eamag15 Aug 2025 17:49 UTC

6 points

0 comments2 min readLW link

(eamag.me)

Theory of culture as waste.

Laureana Bonaparte15 Aug 2025 17:34 UTC

−3 points

15 comments2 min readLW link

Spending Too Much Time At Airports

Zvi15 Aug 2025 16:10 UTC

59 points

24 comments7 min readLW link

(thezvi.wordpress.com)

How to make the future better (other than by reducing extinction risk)

wdmacaskill15 Aug 2025 15:40 UTC

17 points

1 comment3 min readLW link

Should you start a for-profit AI safety org?

KatWoods15 Aug 2025 13:52 UTC

8 points

4 comments1 min readLW link

How to get ChatGPT to really thoroughly research something

KatWoods15 Aug 2025 12:54 UTC

18 points

1 comment1 min readLW link

Thoughts on Gradual Disempowerment

Tom Davidson15 Aug 2025 11:56 UTC

65 points

32 comments19 min readLW link

Misalignment classifiers: Why they’re hard to evaluate adversarially, and why we’re studying them anyway

Charlie Griffin, ollie, oliverfm, Rogan Inglis and Alan Cooney

15 Aug 2025 11:48 UTC

68 points

3 comments17 min readLW link

A Phylogeny of Agents

Jonas Hallgren and markov

15 Aug 2025 10:47 UTC

40 points

12 comments6 min readLW link

(substack.com)

My kids won’t be workers

Gauraventh15 Aug 2025 7:06 UTC

3 points

0 comments6 min readLW link

(y1d2.com)

European Links (15.08.25)

Martin Sustrik15 Aug 2025 4:20 UTC

21 points

8 comments2 min readLW link

(www.250bpm.com)

Legal Personhood—Three Prong Bundle Theory

Stephen Martin15 Aug 2025 4:13 UTC

13 points

6 comments4 min readLW link

Mental Gymnastics.

Laureana Bonaparte15 Aug 2025 4:08 UTC

3 points

0 comments13 min readLW link

Rare AI and the Fermi Paradox

dawnstrata15 Aug 2025 4:05 UTC

11 points

6 comments9 min readLW link

Tristan’s Projects

TristanTrim15 Aug 2025 3:46 UTC

10 points

4 comments3 min readLW link

Trialing Far UVC and Glycol Vapors at BIDA

jefftk15 Aug 2025 2:20 UTC

19 points

1 comment2 min readLW link

(www.jefftk.com)

A philosophical kernel: biting analytic bullets

jessicata15 Aug 2025 1:35 UTC

64 points

21 comments13 min readLW link

(unstableontology.com)

A letter to Kyle Fish on the Retirement of Claude 3 Sonnet

bridgebot15 Aug 2025 1:08 UTC

−4 points

3 comments5 min readLW link

Conceptual Rhyme and Metaphor

Jordan Rubin15 Aug 2025 0:05 UTC

2 points

0 comments9 min readLW link

(jordanmrubin.substack.com)

Training a Reward Hacker Despite Perfect Labels

ariana_azarbal, Victor Gillioz and TurnTrout

14 Aug 2025 23:57 UTC

142 points

47 comments4 min readLW link

AGI: Probably Not 2027

Tomás B.14 Aug 2025 22:24 UTC

19 points

8 comments1 min readLW link

(www.verysane.ai)

Four Axes of Hunger

Brendan Long14 Aug 2025 19:03 UTC

25 points

3 comments2 min readLW link

(www.brendanlong.com)

Somebody invented a better bookmark

Alex_Altair14 Aug 2025 17:57 UTC

178 points

23 comments2 min readLW link

In defense of the amyloid hypothesis

dsj14 Aug 2025 17:52 UTC

50 points

0 comments1 min readLW link

(www.astralcodexten.com)

A Practical Tool for Mapping and Quantifying Belief Networks

Zack Friedman14 Aug 2025 17:22 UTC

7 points

0 comments1 min readLW link

AI #129: Comically Unconstitutional

Zvi14 Aug 2025 14:10 UTC

47 points

3 comments55 min readLW link

(thezvi.wordpress.com)

Healthcare as education

Coafos14 Aug 2025 13:31 UTC

4 points

0 comments3 min readLW link

About Stress

Gabriel Alfour14 Aug 2025 10:33 UTC

25 points

0 comments1 min readLW link

(cognition.cafe)

Legal Personhood—The “Enforcement Gap”

Stephen Martin14 Aug 2025 6:07 UTC

8 points

0 comments3 min readLW link

Sleeping Machines: Why Our AI Agents Still Behave Like Talented Children

Michal Barodkin14 Aug 2025 2:31 UTC

23 points

4 comments8 min readLW link

Exploring the “Anti-TESCREAL” Ideology and the Roots of (Anti-)Progress

Ottokar Hochman14 Aug 2025 2:30 UTC

24 points

2 comments2 min readLW link

(recapitulation.substack.com)

A YouTube Video Will Probably Never Help You Quit YouTube

boundary_condition14 Aug 2025 0:59 UTC

26 points

11 comments10 min readLW link

Should you make stone tools?

Alex_Altair14 Aug 2025 0:15 UTC

196 points

48 comments3 min readLW link

METR Research Update: Algorithmic vs. Holistic Evaluation

David Rein13 Aug 2025 22:47 UTC

101 points

7 comments1 min readLW link

(metr.org)

Interiors can be more fun

Nina Panickssery13 Aug 2025 22:42 UTC

34 points

6 comments4 min readLW link

(blog.ninapanickssery.com)

Against Epistemic Democracy: A Epistemic Tier List of What Actually Works

Linch13 Aug 2025 21:28 UTC

9 points

3 comments1 min readLW link

(linch.substack.com)

Good Faith Arguments

Gordon Seidoh Worley13 Aug 2025 20:50 UTC

1 point

0 comments3 min readLW link

(uncertainupdates.substack.com)

Doing A Thing Puts You in The Top 10% (And That Sucks)

Brendan Long13 Aug 2025 19:50 UTC

81 points

26 comments2 min readLW link

(www.brendanlong.com)

Intriguing Properties of gpt-oss Jailbreaks

Zephaniah Roe and jcksanderson

13 Aug 2025 19:42 UTC

20 points

0 comments10 min readLW link

(xlabaisecurity.com)

ChatGPT Caused Psychosis via Poisoning

Adele Lopez13 Aug 2025 19:15 UTC

19 points

2 comments1 min readLW link

Tech Tree for Secure Multipolar AI

Allison Duettmann and LindaPetrini

13 Aug 2025 17:18 UTC

11 points

3 comments2 min readLW link

Launching new AIXI research community website + reading group(s)

Cole Wyeth13 Aug 2025 17:09 UTC

46 points

2 comments1 min readLW link

AI development as the first fully-automated job

tailcalled13 Aug 2025 16:45 UTC

19 points

4 comments1 min readLW link

Probing Power-Seeking in LLMs

Moksh Nirvaan13 Aug 2025 16:04 UTC

8 points

0 comments12 min readLW link

GPT-5s Are Alive: Synthesis

Zvi13 Aug 2025 14:10 UTC

44 points

1 comment31 min readLW link

(thezvi.wordpress.com)

Books, maps, and teachings

Richard_Kennaway13 Aug 2025 11:44 UTC

14 points

1 comment3 min readLW link