All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All Jan Feb Mar AprMayJun

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30 31

Goodhart’s Law and a Minimum Viable Sugarscape: Karpathy Pattern ABM Autoresearch

Raven Of Empire23 May 2026 23:24 UTC

2 points

0 comments6 min readLW link

Veganism is Virtuous, not Obligatory

Hide23 May 2026 23:19 UTC

10 points

10 comments25 min readLW link

(hidefromit.substack.com)

Low Expectancy is Not a Confidence Problem

Alex A23 May 2026 22:48 UTC

13 points

1 comment2 min readLW link

Basic principles for dressing better.

spookycat23 May 2026 19:59 UTC

69 points

24 comments5 min readLW link

Boltzmann brains, like Doomsday, require no explaining

Steff23 May 2026 16:16 UTC

−18 points

3 comments12 min readLW link

Probabilities are not the right concept

David Matolcsi23 May 2026 16:10 UTC

82 points

30 comments15 min readLW link

Your Left Brain Doesn’t Trade With Your Right

Alexander Gietelink Oldenziel23 May 2026 15:12 UTC

53 points

22 comments5 min readLW link

Out-of-Context Reasoning (OOCR) in LLMs: A Short Primer and Reading List

Owain_Evans23 May 2026 2:46 UTC

41 points

2 comments5 min readLW link

(outofcontextreasoning.com)

Capitalism is only the first of our problems

Ian Matson23 May 2026 2:22 UTC

−9 points

1 comment5 min readLW link

A political movement will save us from extinction

rohantohab23 May 2026 2:12 UTC

1 point

1 comment1 min readLW link

(open.substack.com)

How should we update on AI-enabled coups post-Mythos?

callumzc23 May 2026 2:10 UTC

16 points

3 comments5 min readLW link

How Humans Will Achieve Immortality (by transcending biology and becoming machine intelligence)

SeymourJReid23 May 2026 2:08 UTC

−4 points

0 comments8 min readLW link

PLA Daily Translation: Reflections on Warfare Brought by AGI

eeeee23 May 2026 0:52 UTC

51 points

1 comment11 min readLW link

Can Large Language Models Identify Novel Threats? Part 1: Mirror Life and the Classification Gap

Failfinder7023 May 2026 0:15 UTC

8 points

0 comments3 min readLW link

The Leaky AI Safety Pipeline

Nikhil Kalidasu23 May 2026 0:14 UTC

12 points

0 comments5 min readLW link

(crosscurrents.ink)

The Fundamentals of Cogitism: Grounding Ethics in the Nature of Consciousness

ArtiFabian23 May 2026 0:13 UTC

3 points

4 comments4 min readLW link

(sikerspot.com)

Looking for backdoors in Jane Street LLMs

Cipolla23 May 2026 0:06 UTC

16 points

0 comments14 min readLW link

Will we really put data centers in space?

Avi Parrack and fin

22 May 2026 23:51 UTC

91 points

23 comments5 min readLW link

(www.forethought.org)

We made a map of the doom debate

Sean Herrington, Paul Hindoian, mikaelacankosyan, David Bravo, keivnc, Josh Tuffy, Khai Tran, Maryam Hampaei and Christopher A. Davis

22 May 2026 23:24 UTC

40 points

9 comments6 min readLW link

Which technical AI safety fields are going to be automated first?

Chamod Kalupahana22 May 2026 17:32 UTC

21 points

5 comments6 min readLW link

Gemini 3.5 Flash Looks Good For How Fast It Is

Zvi22 May 2026 17:30 UTC

34 points

4 comments7 min readLW link

(thezvi.wordpress.com)

The AI Industrial Explosion — Part 3: Going faster

djbinder22 May 2026 16:38 UTC

19 points

2 comments14 min readLW link

(defensesindepth.bio)

Strong Longtermism Is Simply Correct

Bentham's Bulldog22 May 2026 15:57 UTC

1 point

1 comment19 min readLW link

Notes on Collaborating with Claude Opus

Nissa Seru22 May 2026 15:35 UTC

40 points

2 comments1 min readLW link

Proposal for “Timelines to what”: DIAL distribution

tlevin22 May 2026 14:40 UTC

21 points

0 comments1 min readLW link

AI is Not Normal Technology

Olivia Scharfman22 May 2026 10:27 UTC

16 points

2 comments19 min readLW link

Counting Arguments in AI Safety

Samuel Ratnam22 May 2026 8:43 UTC

16 points

13 comments3 min readLW link

(substack.com)

Insurance Premiums To The Moon

PossiblyElaine22 May 2026 6:09 UTC

18 points

1 comment4 min readLW link

(possiblyelaine.substack.com)

Moderator’s Principle of Least Surprise

Czynski21 May 2026 21:02 UTC

21 points

6 comments6 min readLW link

(dangeroussincerity.substack.com)

You can opt out of allergies

Rattengift21 May 2026 19:54 UTC

31 points

12 comments1 min readLW link

Possible red is red

avturchin21 May 2026 17:30 UTC

1 point

9 comments4 min readLW link

Apr-May 2026 AI Security via Formal Methods

Quinn21 May 2026 15:40 UTC

12 points

0 comments1 min readLW link

(newsletter.for-all.dev)

An Introduction to Neo-Fatalism

julius vidal21 May 2026 15:18 UTC

4 points

0 comments11 min readLW link

(cyberzenics.substack.com)

Loss of Oversight: How AI Systems May Become Harder to Audit, Monitor, and Investigate

Jordan Taylor, Max H, Ed Fage, Thomas Read and Joseph Bloom

21 May 2026 14:52 UTC

83 points

0 comments6 min readLW link

(www.aisi.gov.uk)

AI #169: New Knowledge

Zvi21 May 2026 13:20 UTC

39 points

10 comments47 min readLW link

(thezvi.wordpress.com)

What am I, if not an AI?

makiba21 May 2026 13:14 UTC

84 points

14 comments7 min readLW link

Learned Chain-of-Thought Obfuscation Generalises to Unseen Tasks

Nathaniel Mitrani, sassanb, Cam and Puria

21 May 2026 10:11 UTC

31 points

0 comments5 min readLW link

(arxiv.org)

Why is having a child inherently selfish?

JacksonTan21 May 2026 3:51 UTC

0 points

1 comment1 min readLW link

Numb mental state shifts

KatjaGrace21 May 2026 3:50 UTC

36 points

2 comments1 min readLW link

(worldspiritsockpuppet.com)

Women should be able to open things

KatjaGrace21 May 2026 3:50 UTC

340 points

134 comments2 min readLW link

(worldspiritsockpuppet.com)

Why are people so scared of causing fear?

KatjaGrace21 May 2026 3:50 UTC

39 points

4 comments2 min readLW link

(worldspiritsockpuppet.com)

Document-tuning instills durable animal compassion in LLMs (and generalizes to humans)

Jasmine Brazilek and MilesTS

21 May 2026 3:29 UTC

11 points

0 comments6 min readLW link

What About Us?

James Stephen Brown21 May 2026 2:48 UTC

4 points

0 comments5 min readLW link

(nonzerosum.games)

The Whole Kitten-Cavoodle

James Stephen Brown21 May 2026 2:32 UTC

5 points

0 comments5 min readLW link

Why does off-model SFT degrade capabilities?

SebastianP, Dylan Xu, Alek Westover, Julian Stastny and Vivek Hebbar

21 May 2026 0:35 UTC

42 points

9 comments6 min readLW link

If I Were Emperor of New AI Safety Researcher Training...

Lorxus20 May 2026 23:10 UTC

21 points

3 comments8 min readLW link

(tiled-with-pentagons.blogspot.com)

theory uplift differentially benefits safety & is underleveraged

yudhister20 May 2026 21:43 UTC

133 points

14 comments1 min readLW link

Singular Learning Theory Comprehensive − 1

Agastya Agrawal20 May 2026 20:00 UTC

35 points

1 comment12 min readLW link

Sparse Efficiency vs. Superposition: The Interpretability Tradeoff

hillz20 May 2026 19:14 UTC

8 points

0 comments1 min readLW link

The Case for Evaluating Model Behaviors

jsteinhardt20 May 2026 18:42 UTC

40 points

3 comments3 min readLW link