All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All JanFebMar Apr May Jun

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Lexicon of Life Regulation

henophilia3 Feb 2026 22:39 UTC

−15 points

0 comments15 min readLW link

(blog.hermesloom.org)

‘Inventing the Renaissance’ Review

Commander Zander3 Feb 2026 22:01 UTC

60 points

2 comments3 min readLW link

Concrete research ideas on AI personas

nielsrolf, Maxime Riché and Daniel Tan

3 Feb 2026 21:50 UTC

69 points

10 comments6 min readLW link

Progress links and short notes, 2026-01-26

jasoncrawford3 Feb 2026 21:42 UTC

11 points

0 comments5 min readLW link

(rootsofprogress.substack.com)

The Projection Problem: Two Pitfalls in AI Safety Research

Shivam3 Feb 2026 21:03 UTC

6 points

2 comments6 min readLW link

New AI safety funding newsletter

Bryce Robertson3 Feb 2026 20:23 UTC

42 points

0 comments1 min readLW link

disgust at utility maximization

pantalaimon3 Feb 2026 20:07 UTC

1 point

4 comments1 min readLW link

METR have released Time Horizons 1.1

Sean Herrington3 Feb 2026 19:48 UTC

33 points

0 comments1 min readLW link

(metr.org)

AI Safety at the Frontier: Paper Highlights of January 2026

gasteigerjo3 Feb 2026 18:56 UTC

22 points

0 comments9 min readLW link

(aisafetyfrontier.substack.com)

Unless That Claw Is The Famous OpenClaw

Zvi3 Feb 2026 15:00 UTC

39 points

5 comments16 min readLW link

(thezvi.wordpress.com)

Exponential takeoff of mediocrity

Valerii K.3 Feb 2026 14:41 UTC

4 points

5 comments32 min readLW link

AI for Human Reasoning for Rationalists

Oliver Sourbut3 Feb 2026 13:22 UTC

29 points

0 comments4 min readLW link

(www.oliversourbut.net)

Conditionalization Confounds Inoculation Prompting Results

Maxime Riché and nielsrolf

3 Feb 2026 11:50 UTC

78 points

5 comments19 min readLW link

The Atoms of Knowledge Aren’t Universal

Jonas Hallgren3 Feb 2026 10:52 UTC

19 points

4 comments13 min readLW link

(equilibria1.substack.com)

What did we learn from the AI Village in 2025?

Shoshannah Tekofsky3 Feb 2026 9:52 UTC

63 points

5 comments10 min readLW link

(theaidigest.org)

Thought Editing: Steering Models by Editing Their Chain of Thought

Anton de la Fuente and Josh Engels

3 Feb 2026 9:51 UTC

20 points

0 comments5 min readLW link

Design international AI projects with DAID in mind

wdmacaskill3 Feb 2026 8:50 UTC

5 points

0 comments5 min readLW link

(www.forethought.org)

The Adolescence is Already Here

Priyanka Bharadwaj3 Feb 2026 7:43 UTC

33 points

2 comments2 min readLW link

Addressing Decision Theory’s Simulation Problem

Ashe Vazquez Nuñez3 Feb 2026 7:02 UTC

11 points

0 comments3 min readLW link

Paternal-Narrative Approach to AI Alignment

JD Croft3 Feb 2026 3:19 UTC

0 points

1 comment9 min readLW link

Nonprofits Deserve Better Operations

Deena Englander3 Feb 2026 2:38 UTC

−2 points

3 comments6 min readLW link

Will AGI arrive before the worst climate tipping points?

SethW3 Feb 2026 2:36 UTC

13 points

0 comments8 min readLW link

(carboncreatures.substack.com)

Increasing AI Strategic Competence as a Safety Approach

Wei Dai3 Feb 2026 1:08 UTC

53 points

9 comments1 min readLW link

Conditional Kickstarter for the “Don’t Build It” March

Raemon2 Feb 2026 22:58 UTC

165 points

35 comments4 min readLW link

Three ways to make Claude’s constitution better

Parv Mahajan2 Feb 2026 21:48 UTC

36 points

3 comments2 min readLW link

Cross-Layer Transcoders are incentivized to learn Unfaithful Circuits

Georg Lange, RGRGRG, Kat Dearstyne and Kamal Maher

2 Feb 2026 21:32 UTC

46 points

6 comments18 min readLW link

Games as meditation

Vadim Golub2 Feb 2026 21:10 UTC

2 points

0 comments3 min readLW link

On Goal-Models

Richard_Ngo2 Feb 2026 18:44 UTC

136 points

15 comments4 min readLW link

“Features” aren’t always the true computational primitives of a model, but that might be fine anyways

LawrenceC2 Feb 2026 18:41 UTC

18 points

0 comments5 min readLW link

Are there lessons from high-reliability engineering for AGI safety?

Steven Byrnes2 Feb 2026 15:26 UTC

161 points

15 comments8 min readLW link

Welcome to Moltbook

Zvi2 Feb 2026 14:30 UTC

58 points

2 comments29 min readLW link

(thezvi.wordpress.com)

Moltbook and the AI Alignment Problem

Logan Zoellner2 Feb 2026 9:35 UTC

15 points

1 comment5 min readLW link

Applications Open for Impact Accelerator Program

High Impact Professionals2 Feb 2026 9:34 UTC

1 point

0 comments1 min readLW link

Empiricist and Narrator

George3d62 Feb 2026 9:12 UTC

10 points

2 comments7 min readLW link

(cerebralab.com)

[Question] Proposition of policy for writing articles to fact check faster

Crazy philosopher2 Feb 2026 8:51 UTC

3 points

0 comments1 min readLW link

I finally fixed my footwear

dominicq2 Feb 2026 7:32 UTC

69 points

11 comments3 min readLW link

(sundaystopwatch.eu)

About half of Moltbook posts show desire for self-improvement

Stephen Elliott2 Feb 2026 6:14 UTC

20 points

11 comments2 min readLW link

How to prevent building a software-Ultron

PratyushRT2 Feb 2026 6:07 UTC

1 point

0 comments2 min readLW link

The limiting factor in AI programming is the synchronization overhead between two minds

jnalanko2 Feb 2026 6:04 UTC

20 points

3 comments1 min readLW link

Applying Temperature to LLM Outputs Semantically to Minimise Low-Temperature Hallucinations

Brodie Eaton2 Feb 2026 6:02 UTC

9 points

0 comments4 min readLW link

Thoughts the Unreasonable Effectiveness of Maths

Srdjan Miletic2 Feb 2026 6:00 UTC

16 points

5 comments4 min readLW link

(www.dissent.blog)

The Smoking Lesion Doesn’t Really Distinguish EDT from CDT

Srdjan Miletic2 Feb 2026 5:57 UTC

14 points

5 comments2 min readLW link

(www.dissent.blog)

Word importance in text ⇐ conditional information of the token in the context. Is this assumption valid?

yun dong2 Feb 2026 5:50 UTC

3 points

3 comments1 min readLW link

The Meta-Anthropic Argument

RogerDearnaley2 Feb 2026 1:10 UTC

41 points

55 comments2 min readLW link

Emotions and Reality

small identity1 Feb 2026 22:40 UTC

13 points

1 comment4 min readLW link

Situational Awareness is (mostly) here to stay

atharva1 Feb 2026 21:40 UTC

10 points

0 comments1 min readLW link

Are you looking for Neptune or Vulcan?

Mati_Roy1 Feb 2026 20:59 UTC

18 points

0 comments1 min readLW link

What It’s Like To Be A Worm (Notes on Borderline Sentience)

Niko_McCarty1 Feb 2026 17:33 UTC

18 points

3 comments25 min readLW link

(www.asimov.press)

Differentially Scary Movies

jefftk1 Feb 2026 14:40 UTC

43 points

1 comment1 min readLW link

(www.jefftk.com)

Would you kill a vulcan to save a shrimp?

James Diacoumis1 Feb 2026 12:46 UTC

10 points

8 comments6 min readLW link

(substack.com)