All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All JanFebMar Apr May Jun

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Strategy of von Neumann and strategy of Rosenbergs

avturchin6 Feb 2026 22:50 UTC

5 points

4 comments2 min readLW link

Data-Centric Interpretability for LLM-based Multi-Agent Reinforcement Learning

michaelwaves, Yanjo and Yuqi Sun

6 Feb 2026 19:27 UTC

10 points

0 comments4 min readLW link

Parks Aren’t Nature

Sable6 Feb 2026 18:27 UTC

50 points

11 comments8 min readLW link

(affablyevil.substack.com)

Claude Code #4: From The Before Times

Zvi6 Feb 2026 18:01 UTC

42 points

1 comment23 min readLW link

(thezvi.wordpress.com)

Robust Finite Policies are Nontrivially Structured

Winter Cross6 Feb 2026 17:47 UTC

26 points

1 comment11 min readLW link

In (highly contingent!) defense of interpretability-in-the-loop ML training

Steven Byrnes6 Feb 2026 16:32 UTC

85 points

11 comments3 min readLW link

Spectral Signatures of Gradual Disempowerment

Jonas Hallgren6 Feb 2026 15:08 UTC

36 points

4 comments17 min readLW link

(equilibria1.substack.com)

Demands Are All You Need: Prompt Imperativeness Drastically Reduces Hedging In LLMs (n=900, Cohen’s d = 2.67)

fluxxrider6 Feb 2026 13:22 UTC

6 points

0 comments16 min readLW link

[Question] If all humans were turned into high-fidelity mind uploads tomorrow, would we be self-sustaining?

Erich_Grunewald6 Feb 2026 8:35 UTC

11 points

2 comments1 min readLW link

AI benchmarking has a Y-axis problem

Lizka6 Feb 2026 7:45 UTC

79 points

3 comments7 min readLW link

Claude Opus 4.6 is Driven

HunterJay6 Feb 2026 4:15 UTC

113 points

1 comment5 min readLW link

Why ASI Might Preserve Its Progenitors

Luke J. Dawes6 Feb 2026 2:54 UTC

2 points

0 comments12 min readLW link

How Dario Amodei’s “The Adolescence of Technology” Delegitimizes AI X-Risk Concerns

Liron and Harlan

6 Feb 2026 2:07 UTC

12 points

6 comments50 min readLW link

(doomdebates.com)

[Question] Goodfire and Training on Interpretability

Satya Benson6 Feb 2026 1:45 UTC

32 points

5 comments1 min readLW link

Plan ’Straya

William the Kiwi 6 Feb 2026 0:14 UTC

16 points

5 comments5 min readLW link

TT Self Study Journal # 6

TristanTrim5 Feb 2026 23:41 UTC

5 points

3 comments3 min readLW link

The Simplest Case for AI Catastrophe

Linch5 Feb 2026 23:18 UTC

77 points

9 comments10 min readLW link

(linch.substack.com)

TT’s Looking-for-Work Strategy

TristanTrim5 Feb 2026 21:40 UTC

4 points

0 comments1 min readLW link

Agent Economics: a BOTEC on feasibility

Margot5 Feb 2026 20:15 UTC

28 points

0 comments6 min readLW link

(forum.effectivealtruism.org)

Moltbook as a setting to analyze Power Seeking behaviour

Rahul N5 Feb 2026 20:07 UTC

11 points

0 comments1 min readLW link

(propensitylabs.substack.com)

The nature of LLM algorithmic progress (v2)

Steven Byrnes5 Feb 2026 19:17 UTC

116 points

27 comments13 min readLW link

Biotech Startup Stats

sarahconstantin5 Feb 2026 18:40 UTC

21 points

0 comments4 min readLW link

(sarahconstantin.substack.com)

On The Lies Depression Tells

sonicrocketman5 Feb 2026 17:13 UTC

25 points

2 comments3 min readLW link

(brianschrader.com)

Speedrunning a Mech Interp Research Setup (Remote GPU, Torch, TransformerLens, Cuda, SSH, VS Code)

J Rosser5 Feb 2026 16:45 UTC

38 points

3 comments4 min readLW link

What’s the concrete plan to become an incredibly agentic person?

Peter Berggren5 Feb 2026 16:27 UTC

12 points

3 comments3 min readLW link

AI #154: Claw Your Way To The Top

Zvi5 Feb 2026 16:10 UTC

38 points

2 comments43 min readLW link

(thezvi.wordpress.com)

Preparing for a Warning Shot

Noah Birnbaum5 Feb 2026 15:10 UTC

43 points

5 comments4 min readLW link

A Proposal for TruesightBench

David Africa5 Feb 2026 14:33 UTC

14 points

0 comments4 min readLW link

Scratching the sore: how pleasure relates to suffering

Vadim Golub5 Feb 2026 12:05 UTC

−1 points

35 comments2 min readLW link

What’s the Point of the Math?

Ashe Vazquez Nuñez5 Feb 2026 11:30 UTC

46 points

3 comments5 min readLW link

Short List of Public Rationalist Online Discussion Groups in 2026

Shoshannah Tekofsky5 Feb 2026 10:33 UTC

14 points

2 comments1 min readLW link

Idea: the intelligence explosion convention

wdmacaskill5 Feb 2026 9:11 UTC

20 points

0 comments9 min readLW link

(www.forethought.org)

Is Note-taking a favor or a burden to my future-self?

CstineSublime5 Feb 2026 6:22 UTC

18 points

17 comments1 min readLW link

Episodic memory in AI agents poses new safety risks

Chad DeChant5 Feb 2026 5:28 UTC

13 points

1 comment10 min readLW link

Finding Cruxes: Help Reality Punch You In the Face

Raemon5 Feb 2026 2:11 UTC

71 points

0 comments8 min readLW link

How to train any multiagent systems end-to-end from AI feedback

Ed Li5 Feb 2026 2:00 UTC

1 point

0 comments1 min readLW link

In Search of Lost Time—A Review

eniteris5 Feb 2026 1:46 UTC

17 points

1 comment10 min readLW link

Solemn Courage

aysja4 Feb 2026 23:09 UTC

128 points

1 comment6 min readLW link

p-values are good actually

speck14474 Feb 2026 22:04 UTC

9 points

8 comments3 min readLW link

Chess bots do not have goals

zulupineapple4 Feb 2026 21:11 UTC

2 points

10 comments1 min readLW link

Preventing the apocalypse with power distribution theory

Rationalist112354 Feb 2026 18:44 UTC

2 points

0 comments4 min readLW link

Post-AGI Economics As If Nothing Ever Happens

Jan_Kulveit4 Feb 2026 17:39 UTC

254 points

43 comments8 min readLW link

(boundedlyrational.substack.com)

Vibestemics

Gordon Seidoh Worley4 Feb 2026 16:40 UTC

13 points

10 comments5 min readLW link

(www.uncertainupdates.com)

Kimi K2.5

Zvi4 Feb 2026 15:30 UTC

33 points

0 comments10 min readLW link

(thezvi.wordpress.com)

Ralph-wiggum is Bad and Anthropic Should Fix It

d4hines4 Feb 2026 15:26 UTC

27 points

11 comments1 min readLW link

Who does a right to compute actually protect?

TFD4 Feb 2026 15:09 UTC

25 points

0 comments5 min readLW link

(www.thefloatingdroid.com)

Reconciling Shannon and Bayes.

Laureana Bonaparte4 Feb 2026 14:33 UTC

−24 points

1 comment1 min readLW link

(wallstreetweather.org)

Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)

RobertM4 Feb 2026 6:30 UTC

288 points

28 comments6 min readLW link

A Black Box Made Less Opaque (part 2)

Matthew McDonnell4 Feb 2026 4:12 UTC

6 points

0 comments15 min readLW link

Thoughts on Toby Ords’ AI Scaling Series

Srdjan Miletic4 Feb 2026 0:41 UTC

10 points

1 comment4 min readLW link

(www.dissent.blog)