All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All JanFebMar Apr May Jun

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Solemn Courage

aysja4 Feb 2026 23:09 UTC

128 points

1 comment6 min readLW link

p-values are good actually

speck14474 Feb 2026 22:04 UTC

9 points

8 comments3 min readLW link

Chess bots do not have goals

zulupineapple4 Feb 2026 21:11 UTC

2 points

10 comments1 min readLW link

Preventing the apocalypse with power distribution theory

Rationalist112354 Feb 2026 18:44 UTC

2 points

0 comments4 min readLW link

Post-AGI Economics As If Nothing Ever Happens

Jan_Kulveit4 Feb 2026 17:39 UTC

254 points

43 comments8 min readLW link

(boundedlyrational.substack.com)

Vibestemics

Gordon Seidoh Worley4 Feb 2026 16:40 UTC

13 points

10 comments5 min readLW link

(www.uncertainupdates.com)

Kimi K2.5

Zvi4 Feb 2026 15:30 UTC

33 points

0 comments10 min readLW link

(thezvi.wordpress.com)

Ralph-wiggum is Bad and Anthropic Should Fix It

d4hines4 Feb 2026 15:26 UTC

27 points

11 comments1 min readLW link

Who does a right to compute actually protect?

TFD4 Feb 2026 15:09 UTC

25 points

0 comments5 min readLW link

(www.thefloatingdroid.com)

Reconciling Shannon and Bayes.

Laureana Bonaparte4 Feb 2026 14:33 UTC

−24 points

1 comment1 min readLW link

(wallstreetweather.org)

Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)

RobertM4 Feb 2026 6:30 UTC

288 points

28 comments6 min readLW link

A Black Box Made Less Opaque (part 2)

Matthew McDonnell4 Feb 2026 4:12 UTC

6 points

0 comments15 min readLW link

Thoughts on Toby Ords’ AI Scaling Series

Srdjan Miletic4 Feb 2026 0:41 UTC

10 points

1 comment4 min readLW link

(www.dissent.blog)

Lexicon of Life Regulation

henophilia3 Feb 2026 22:39 UTC

−15 points

0 comments15 min readLW link

(blog.hermesloom.org)

‘Inventing the Renaissance’ Review

Commander Zander3 Feb 2026 22:01 UTC

60 points

2 comments3 min readLW link

Concrete research ideas on AI personas

nielsrolf, Maxime Riché and Daniel Tan

3 Feb 2026 21:50 UTC

69 points

10 comments6 min readLW link

Progress links and short notes, 2026-01-26

jasoncrawford3 Feb 2026 21:42 UTC

11 points

0 comments5 min readLW link

(rootsofprogress.substack.com)

The Projection Problem: Two Pitfalls in AI Safety Research

Shivam3 Feb 2026 21:03 UTC

6 points

2 comments6 min readLW link

New AI safety funding newsletter

Bryce Robertson3 Feb 2026 20:23 UTC

42 points

0 comments1 min readLW link

disgust at utility maximization

pantalaimon3 Feb 2026 20:07 UTC

1 point

4 comments1 min readLW link

METR have released Time Horizons 1.1

Sean Herrington3 Feb 2026 19:48 UTC

33 points

0 comments1 min readLW link

(metr.org)

AI Safety at the Frontier: Paper Highlights of January 2026

gasteigerjo3 Feb 2026 18:56 UTC

22 points

0 comments9 min readLW link

(aisafetyfrontier.substack.com)

Unless That Claw Is The Famous OpenClaw

Zvi3 Feb 2026 15:00 UTC

39 points

5 comments16 min readLW link

(thezvi.wordpress.com)

Exponential takeoff of mediocrity

Valerii K.3 Feb 2026 14:41 UTC

4 points

5 comments32 min readLW link

AI for Human Reasoning for Rationalists

Oliver Sourbut3 Feb 2026 13:22 UTC

29 points

0 comments4 min readLW link

(www.oliversourbut.net)

Conditionalization Confounds Inoculation Prompting Results

Maxime Riché and nielsrolf

3 Feb 2026 11:50 UTC

78 points

5 comments19 min readLW link

The Atoms of Knowledge Aren’t Universal

Jonas Hallgren3 Feb 2026 10:52 UTC

19 points

4 comments13 min readLW link

(equilibria1.substack.com)

What did we learn from the AI Village in 2025?

Shoshannah Tekofsky3 Feb 2026 9:52 UTC

63 points

5 comments10 min readLW link

(theaidigest.org)

Thought Editing: Steering Models by Editing Their Chain of Thought

Anton de la Fuente and Josh Engels

3 Feb 2026 9:51 UTC

20 points

0 comments5 min readLW link

Design international AI projects with DAID in mind

wdmacaskill3 Feb 2026 8:50 UTC

5 points

0 comments5 min readLW link

(www.forethought.org)

The Adolescence is Already Here

Priyanka Bharadwaj3 Feb 2026 7:43 UTC

33 points

2 comments2 min readLW link

Addressing Decision Theory’s Simulation Problem

Ashe Vazquez Nuñez3 Feb 2026 7:02 UTC

11 points

0 comments3 min readLW link

Paternal-Narrative Approach to AI Alignment

JD Croft3 Feb 2026 3:19 UTC

0 points

1 comment9 min readLW link

Nonprofits Deserve Better Operations

Deena Englander3 Feb 2026 2:38 UTC

−2 points

3 comments6 min readLW link

Will AGI arrive before the worst climate tipping points?

SethW3 Feb 2026 2:36 UTC

13 points

0 comments8 min readLW link

(carboncreatures.substack.com)

Increasing AI Strategic Competence as a Safety Approach

Wei Dai3 Feb 2026 1:08 UTC

53 points

9 comments1 min readLW link

Conditional Kickstarter for the “Don’t Build It” March

Raemon2 Feb 2026 22:58 UTC

165 points

35 comments4 min readLW link

Three ways to make Claude’s constitution better

Parv Mahajan2 Feb 2026 21:48 UTC

36 points

3 comments2 min readLW link

Cross-Layer Transcoders are incentivized to learn Unfaithful Circuits

Georg Lange, RGRGRG, Kat Dearstyne and Kamal Maher

2 Feb 2026 21:32 UTC

46 points

6 comments18 min readLW link

Games as meditation

Vadim Golub2 Feb 2026 21:10 UTC

2 points

0 comments3 min readLW link

On Goal-Models

Richard_Ngo2 Feb 2026 18:44 UTC

136 points

15 comments4 min readLW link

“Features” aren’t always the true computational primitives of a model, but that might be fine anyways

LawrenceC2 Feb 2026 18:41 UTC

18 points

0 comments5 min readLW link

Are there lessons from high-reliability engineering for AGI safety?

Steven Byrnes2 Feb 2026 15:26 UTC

161 points

15 comments8 min readLW link

Welcome to Moltbook

Zvi2 Feb 2026 14:30 UTC

58 points

2 comments29 min readLW link

(thezvi.wordpress.com)

Moltbook and the AI Alignment Problem

Logan Zoellner2 Feb 2026 9:35 UTC

15 points

1 comment5 min readLW link

Applications Open for Impact Accelerator Program

High Impact Professionals2 Feb 2026 9:34 UTC

1 point

0 comments1 min readLW link

Empiricist and Narrator

George3d62 Feb 2026 9:12 UTC

10 points

2 comments7 min readLW link

(cerebralab.com)

[Question] Proposition of policy for writing articles to fact check faster

Crazy philosopher2 Feb 2026 8:51 UTC

3 points

0 comments1 min readLW link

I finally fixed my footwear

dominicq2 Feb 2026 7:32 UTC

69 points

11 comments3 min readLW link

(sundaystopwatch.eu)

About half of Moltbook posts show desire for self-improvement

Stephen Elliott2 Feb 2026 6:14 UTC

20 points

11 comments2 min readLW link