All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

How To Become A Mechanistic Interpretability Researcher

Neel Nanda2 Sep 2025 23:38 UTC

155 points

12 comments55 min readLW link

[Question] When Both People Are Interested, How Often Is Flirtatious Escalation Mutual?

johnswentworth2 Sep 2025 23:37 UTC

51 points

15 comments2 min readLW link

Scaling AI Safety in Europe: From Local Groups to International Coordination

MariusWenk2 Sep 2025 23:36 UTC

21 points

3 comments11 min readLW link

Simulating the rest of the political disagreement

Raemon2 Sep 2025 22:06 UTC

128 points

16 comments2 min readLW link

AI Safety at the Frontier: Paper Highlights, August ’25

gasteigerjo2 Sep 2025 20:29 UTC

12 points

0 comments7 min readLW link

(open.substack.com)

Structural engineering in software engineering

Biff Wiff2 Sep 2025 19:07 UTC

25 points

2 comments4 min readLW link

But Have They Engaged With The Arguments? [Linkpost]

Noosphere892 Sep 2025 18:25 UTC

79 points

17 comments2 min readLW link

(philiptrammell.com)

Models vs beliefs

Biff Wiff2 Sep 2025 17:27 UTC

29 points

14 comments2 min readLW link

Non-Dualism and AI Morality

Marcio Díaz2 Sep 2025 17:21 UTC

3 points

4 comments5 min readLW link

%CPU Utilization Is A Lie

Brendan Long2 Sep 2025 17:05 UTC

75 points

9 comments3 min readLW link

(www.brendanlong.com)

Your LLM-assisted scientific breakthrough probably isn’t real

eggsyntax2 Sep 2025 15:05 UTC

161 points

42 comments7 min readLW link

xAI’s new safety framework is dreadful

Zach Stein-Perlman2 Sep 2025 15:00 UTC

108 points

6 comments3 min readLW link

Notes on Dark Sun (The Making of the Hydrogen Bomb)

Joel Burget2 Sep 2025 13:20 UTC

22 points

0 comments23 min readLW link

Three main views on the future of AI

Alex Amadori, Eva_B, Gabriel Alfour and Andrea_Miotti

2 Sep 2025 13:06 UTC

48 points

1 comment1 min readLW link

Traffic and Transit Roundup #1

Zvi2 Sep 2025 12:00 UTC

37 points

4 comments21 min readLW link

(thezvi.wordpress.com)

Gradient routing is better than pretraining filtering

Cleo Nardo2 Sep 2025 9:05 UTC

51 points

3 comments5 min readLW link

Time’s arrow ⇒ decision theory

Aram Ebtekar2 Sep 2025 6:20 UTC

33 points

0 comments2 min readLW link

(doi.org)

The Cats are On To Something

Hastings2 Sep 2025 2:30 UTC

251 points

29 comments3 min readLW link

(www.hgreer.com)

Will Non-Dual Crap Cause Emergent Misalignment?

Marcio Díaz2 Sep 2025 0:12 UTC

25 points

3 comments4 min readLW link

Category-Theoretic Wanderings into Interpretability

unruly abstractions2 Sep 2025 0:03 UTC

19 points

2 comments1 min readLW link

(www.unrulyabstractions.com)

Anthropic’s leading researchers acted as moderate accelerationists

Remmelt1 Sep 2025 23:23 UTC

129 points

72 comments42 min readLW link

⿻ Plurality & 6pack.care

Audrey Tang1 Sep 2025 20:54 UTC

183 points

26 comments13 min readLW link

The Insight Gacha

The Dao of Bayes1 Sep 2025 17:15 UTC

13 points

0 comments3 min readLW link

Dating Roundup #7: Back to Basics

Zvi1 Sep 2025 11:40 UTC

23 points

11 comments29 min readLW link

(thezvi.wordpress.com)

Want to make AI go well for all sentient beings? Apply to a Sentient Futures fellowship or conference!

Damin Curtis1 Sep 2025 8:50 UTC

17 points

0 comments2 min readLW link

Support the movement against extinction risk due to AI

samuelshadrach1 Sep 2025 5:35 UTC

−34 points

8 comments2 min readLW link

(samuelshadrach.com)

Should we align AI with maternal instinct?

Priyanka Bharadwaj1 Sep 2025 3:56 UTC

34 points

16 comments3 min readLW link

Generative AI is not causing YCombinator companies to grow more quickly than usual (yet)

Xodarap1 Sep 2025 3:38 UTC

95 points

8 comments9 min readLW link

Help me understand: how do multiverse acausal trades work?

Aram Ebtekar1 Sep 2025 3:25 UTC

46 points

26 comments2 min readLW link

Newcomber

Charlie Sanders1 Sep 2025 2:29 UTC

6 points

0 comments2 min readLW link

(www.dailymicrofiction.com)

Evaluating Prediction in Acausal Mixed-Motive Settings

Tim Chan31 Aug 2025 22:58 UTC

14 points

0 comments6 min readLW link

My AI Predictions for 2027

Taylor G. Lunt31 Aug 2025 22:00 UTC

39 points

76 comments16 min readLW link

Hedonium is AI Alignment

Tahmatem and Coil

31 Aug 2025 19:46 UTC

−17 points

0 comments6 min readLW link

Legal Personhood—The First Amendment (Part 2)

Stephen Martin31 Aug 2025 12:06 UTC

2 points

0 comments2 min readLW link

A quantum equivalent to Bayes’ rule

dr_s31 Aug 2025 10:06 UTC

51 points

18 comments8 min readLW link

ACX Meetup Wellington

NotEvil31 Aug 2025 5:13 UTC

1 point

1 comment1 min readLW link

Sleeping Experts in the (reflective) Solomonoff Prior

Daniel C and Cole Wyeth

31 Aug 2025 4:55 UTC

16 points

0 comments3 min readLW link

Hacking The Spectrum For Profit (Maybe Fun)

Elek Szid31 Aug 2025 4:49 UTC

7 points

3 comments3 min readLW link

AI agents and painted facades

leni, zef and kaivu

30 Aug 2025 23:13 UTC

38 points

3 comments2 min readLW link

(fulcrumresearch.ai)

ACX Everywhere fall 2025 - Newton, MA

duck_master30 Aug 2025 22:02 UTC

1 point

1 comment1 min readLW link

Female sexual attractiveness seems more egalitarian than people acknowledge

lc30 Aug 2025 18:09 UTC

60 points

35 comments3 min readLW link

AI Sleeper Agents: How Anthropic Trains and Catches Them—Video

Writer30 Aug 2025 17:53 UTC

9 points

0 comments7 min readLW link

(youtu.be)

Understanding LLMs: Insights from Mechanistic Interpretability

Stephen McAleese30 Aug 2025 16:50 UTC

45 points

2 comments30 min readLW link

Legal Personhood—The First Amendment (Part 1)

Stephen Martin30 Aug 2025 13:20 UTC

4 points

0 comments3 min readLW link

Method Iteration: An LLM Prompting Technique

Davey Morse30 Aug 2025 0:08 UTC

−12 points

1 comment2 min readLW link

[Question] How to bet on myself? From expectations to robust goals

Fire Brito de S, Gabriel29 Aug 2025 18:33 UTC

4 points

1 comment1 min readLW link

AI Security London Hackathon

Prince Kumar29 Aug 2025 18:23 UTC

4 points

0 comments1 min readLW link

Summary of our Workshop on Post-AGI Outcomes

David Duvenaud, Raymond Douglas, Nora_Ammann and Jan_Kulveit

29 Aug 2025 17:14 UTC

112 points

3 comments3 min readLW link

Wikipedia, but written by AIs

Viliam29 Aug 2025 16:37 UTC

32 points

10 comments4 min readLW link

60 U.K. Lawmakers Accuse Google of Breaking AI Safety Pledge

Joseph Miller29 Aug 2025 16:09 UTC

51 points

1 comment1 min readLW link

(time.com)