All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All Jan Feb Mar Apr MayJun

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

NLA Thought Anchors

Realmbird31 May 2026 23:38 UTC

10 points

3 comments4 min readLW link

Lighthaven East—A Feasibility Study

JohnofCharleston31 May 2026 22:53 UTC

218 points

46 comments20 min readLW link

Barriers to a Prosperous Future

AJ Weeks31 May 2026 21:34 UTC

8 points

0 comments6 min readLW link

(ajweeks.com)

Notes on axes of variation in third-party risk assessment

Buck31 May 2026 20:48 UTC

38 points

2 comments10 min readLW link

The main impact from automated AI production: concentration of power?

Oliver Sourbut31 May 2026 20:42 UTC

20 points

2 comments7 min readLW link

(www.oliversourbut.net)

A Song About No

jefftk31 May 2026 20:40 UTC

14 points

1 comment1 min readLW link

(www.jefftk.com)

Financial Costs of an AI Pause?

PeterMcCluskey31 May 2026 18:55 UTC

66 points

10 comments6 min readLW link

(bayesianinvestor.com)

Links #2: 2026/05 Part 2

papetoast31 May 2026 13:41 UTC

8 points

0 comments20 min readLW link

Outrunning your headlights

mattshu041031 May 2026 10:42 UTC

41 points

3 comments3 min readLW link

Braining World Models; Predicting Latent Structure via EEG

Raghul Chandramouli31 May 2026 10:41 UTC

1 point

0 comments5 min readLW link

(brain-jepa)

Abstention Geometry: Knowledge and Behaviour Are Dissociable in Llama 3.1 8B

AdeOlu31 May 2026 10:41 UTC

1 point

0 comments9 min readLW link

Food, water and power from thin desert air

Bruce Middleton31 May 2026 8:28 UTC

19 points

2 comments1 min readLW link

Why AI safety researchers should consider a contract research manager position

Mikhail Mironov31 May 2026 8:27 UTC

7 points

0 comments3 min readLW link

Visualize Cyclical Structure in Llama Model

Talib Mirza31 May 2026 8:27 UTC

3 points

0 comments2 min readLW link

Features of SAEs are universal—but only up to an unknown random rotation

Jordan McCann31 May 2026 8:27 UTC

9 points

0 comments10 min readLW link

Agriculture needs another revolution

Dinesh Natesan31 May 2026 8:26 UTC

1 point

4 comments3 min readLW link

Why I think evals are pretty important and most worth working on (for me)

Troy Tian31 May 2026 8:26 UTC

7 points

4 comments1 min readLW link

Fundamental Uncertainty: Alternate Framework and Pointwise Reduction

StanislavKrym31 May 2026 3:47 UTC

12 points

3 comments8 min readLW link

Tween Contra Dance

jefftk31 May 2026 2:00 UTC

20 points

0 comments2 min readLW link

(www.jefftk.com)

Ensemble monitoring for AI control: diverse signals outweigh more compute

Yejun Y., Sam Tetef, eugenekoran, Benjamin Arnav and Pablo Bernabeu-Pérez

31 May 2026 1:21 UTC

12 points

0 comments7 min readLW link

How’s it going? Reinforcement learning in language models recruits a functional welfare axis

andyqhan30 May 2026 23:14 UTC

29 points

1 comment5 min readLW link

AI is a Meteor. Don’t Be a Dinosaur.

Boaz Barak30 May 2026 19:50 UTC

−2 points

7 comments1 min readLW link

An attempted synthesis on probabilities and infinities

David Matolcsi30 May 2026 19:24 UTC

10 points

0 comments27 min readLW link

Comment on “Banning Said Achmiz”

Zack_M_Davis30 May 2026 17:33 UTC

67 points

90 comments50 min readLW link

A Formula for Fun

Ihor Kendiukhov30 May 2026 13:01 UTC

11 points

3 comments8 min readLW link

Open Thread Summer 2026

habryka30 May 2026 5:00 UTC

28 points

11 comments1 min readLW link

Announcing: Iliad’s Fall 2026 Programs

David Udell, Alexander Gietelink Oldenziel and Leon Lang

30 May 2026 4:37 UTC

64 points

7 comments1 min readLW link

Bloomberg terminals for the rest of us

aiechrl30 May 2026 3:13 UTC

34 points

0 comments20 min readLW link

AI as Biology’s Digital Microscope

Darin Tsui30 May 2026 3:11 UTC

10 points

0 comments3 min readLW link

Ablating Induction Heads Leads to an increase in Local Repetition

Arjun Rao30 May 2026 3:11 UTC

8 points

0 comments5 min readLW link

System Prompts vs. Partner Adaptation in LLMs (or, when LLMs know you’re an adult but keep talking like you’re seven)

hi_im_yasha30 May 2026 3:07 UTC

4 points

0 comments7 min readLW link

Belief manifolds, and how to steer along them

Will Mayner30 May 2026 3:05 UTC

8 points

0 comments16 min readLW link

(willmayner.com)

New RFP on extreme power concentration

bengs30 May 2026 3:04 UTC

9 points

0 comments1 min readLW link

What If We Will Stop Destroying People Because Medicine Is Not Ready Yet?

Andrey Panferov30 May 2026 3:02 UTC

1 point

2 comments6 min readLW link

Why tuning fails: The AI has no self

Michael Trifonov30 May 2026 3:01 UTC

6 points

2 comments12 min readLW link

Wall-Mounted Far-UVC

jefftk30 May 2026 2:20 UTC

18 points

2 comments1 min readLW link

(www.jefftk.com)

A new approach to interpretability: round-trip neural network compilation-decompilation

Emma Leonhart29 May 2026 22:23 UTC

9 points

0 comments3 min readLW link

Claude Opus 4.8: The System Card

Zvi29 May 2026 20:50 UTC

64 points

1 comment23 min readLW link

(thezvi.wordpress.com)

Testing Gemini models for scheming tendencies

Vika, David Lindner, Seb Farquhar and Rohin Shah

29 May 2026 19:24 UTC

47 points

8 comments6 min readLW link

(deepmindsafetyresearch.medium.com)

How much should we worry about secretly loyal AIs?

Dave Banerjee29 May 2026 19:14 UTC

13 points

1 comment13 min readLW link

(www.the-substrate.net)

Data you could have observed but didn’t

Gretta Duleba29 May 2026 18:20 UTC

66 points

3 comments1 min readLW link

Is Progress Inevitable?

frmsaul29 May 2026 17:40 UTC

0 points

5 comments4 min readLW link

Retrying vs Resampling in AI Control

james.lucassen and Adam Kaufman

29 May 2026 17:02 UTC

67 points

4 comments9 min readLW link

(blog.redwoodresearch.org)

When Are Two Networks the Same? Tensor Similarity for Mechanistic Interpretability

Logan Riggs, tdooms, Conflux, lwroe, MLNissenGonzalez and mel83

29 May 2026 15:53 UTC

36 points

3 comments4 min readLW link

It takes a village to support a marriage

Shoshannah Tekofsky29 May 2026 15:16 UTC

21 points

5 comments2 min readLW link

(shoshanigans.substack.com)

AI Researchers, Ask Yourself These 6 Questions to Strengthen Your Moral Muscles

Max Tegmark29 May 2026 15:07 UTC

40 points

13 comments7 min readLW link

Maybe we should pretrain on synthetic data about good-but-reward-hacking AIs

Elliott Thornley (EJT)29 May 2026 14:50 UTC

12 points

4 comments3 min readLW link

Hannibal Mistral: the Mistral family has a problem with persona-conditioned elicitation

vigji29 May 2026 12:16 UTC

21 points

0 comments7 min readLW link

Developmental Cognitive Interpretability: A Research Agenda for Modelling Generalisation and Predicting Agent Behaviour

Jason R Brown and Edward James Young

29 May 2026 9:56 UTC

67 points

0 comments7 min readLW link

Relational Consciousness and AGI.

PaddyC29 May 2026 6:49 UTC

−11 points

8 comments1 min readLW link