All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All JanFebMar Apr May Jun

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 242526 27 28

A simple rule for causation

Vivek Hebbar24 Feb 2026 23:14 UTC

37 points

2 comments3 min readLW link

SWE-Bench Pro is even worse

Jonathan Gabor24 Feb 2026 22:51 UTC

24 points

0 comments1 min readLW link

(jonathanpgabor.substack.com)

We are all legal realists now

TFD24 Feb 2026 21:51 UTC

−12 points

1 comment4 min readLW link

(www.thefloatingdroid.com)

Responsible Scaling Policy v3

HoldenKarnofsky24 Feb 2026 20:20 UTC

179 points

82 comments36 min readLW link

[Question] What was the most effective team you’ve ever been on, and what made it excellent?

Eli Tyre24 Feb 2026 20:18 UTC

77 points

7 comments2 min readLW link

Why Attack Success Rate Gives a False Picture of Backdoor Removal

Geoffrey Voyer24 Feb 2026 20:02 UTC

3 points

0 comments12 min readLW link

How I Started Being Productive

atomic24 Feb 2026 19:49 UTC

8 points

0 comments10 min readLW link

Solving The RAISE Act Like a (fictional) New York Detective

Josephine Schwab24 Feb 2026 19:35 UTC

3 points

1 comment6 min readLW link

Exclusive: Hegseth gives Anthropic until Friday to back down on AI safeguards

Matrice Jacobine24 Feb 2026 19:19 UTC

95 points

9 comments3 min readLW link

(www.axios.com)

Cigarette Ads for Babies from Microsoft Bing Image Generator

Edd Schneider24 Feb 2026 19:06 UTC

−4 points

1 comment4 min readLW link

Realistic Evaluations Will Not Prevent Evaluation Awareness

Adam Karvonen24 Feb 2026 17:51 UTC

37 points

9 comments6 min readLW link

The Easiest Route to Secret Loyalty May Be Hijacking the Model’s Chain of Command

Joe Kwon24 Feb 2026 17:47 UTC

16 points

1 comment5 min readLW link

Large-Scale Online Deanonymization with LLMs

Simon Lermen and Daniel Paleka

24 Feb 2026 17:02 UTC

69 points

5 comments4 min readLW link

(simonlermen.substack.com)

Open sourcing a browser extension that shows when people are wrong on the internet

lc24 Feb 2026 16:36 UTC

227 points

34 comments2 min readLW link

(github.com)

Rascal’s Wager

corticalcircuitry24 Feb 2026 16:13 UTC

3 points

2 comments3 min readLW link

(sergey.substack.com)

Citrini’s Scenario Is A Great But Deeply Flawed Thought Experiment

Zvi24 Feb 2026 15:40 UTC

37 points

6 comments22 min readLW link

(thezvi.wordpress.com)

Observations from Running an Agent Collective

williawa24 Feb 2026 15:34 UTC

45 points

2 comments10 min readLW link

What is a species?

David Goodman24 Feb 2026 14:23 UTC

49 points

15 comments26 min readLW link

Moral public goods are a big deal for whether we get a good future

Mia Taylor, Tom Davidson and wdmacaskill

24 Feb 2026 14:14 UTC

12 points

0 comments18 min readLW link

(www.forethought.org)

Two memos from 2024

Richard_Ngo24 Feb 2026 7:19 UTC

38 points

0 comments7 min readLW link

What is computational mechanics? An explainer

Leo Cymbalista24 Feb 2026 6:09 UTC

16 points

0 comments15 min readLW link

Monday AI Radar #14

Against Moloch24 Feb 2026 5:34 UTC

4 points

0 comments6 min readLW link

(againstmoloch.com)

The ML ontology and the alignment ontology

Richard_Ngo24 Feb 2026 4:39 UTC

110 points

9 comments4 min readLW link

[USA Today op-ed]: No, AI isn’t inevitable. We should stop it while we can.

David Scott Krueger24 Feb 2026 2:05 UTC

17 points

0 comments1 min readLW link

(www.usatoday.com)

Bioanchors 2: Electric Bacilli

TsviBT24 Feb 2026 1:07 UTC

38 points

1 comment7 min readLW link

Single Stack LLMs are Split-Brain Patients.

niceminus1924 Feb 2026 0:04 UTC

5 points

0 comments3 min readLW link

Using fiction to imagine a pathway to friendlyAGI

Rick Moss23 Feb 2026 23:48 UTC

3 points

0 comments2 min readLW link

When Benchmarks Lie: Evaluating Malicious Prompt Classifiers Under True Distribution Shift

Max Fomin23 Feb 2026 23:44 UTC

1 point

2 comments6 min readLW link

The persona selection model

Sam Marks23 Feb 2026 22:56 UTC

176 points

53 comments43 min readLW link

(alignment.anthropic.com)

Agenda Reflection: Testing Automated Alignment

Ariel_23 Feb 2026 21:53 UTC

11 points

0 comments2 min readLW link

(zenodo.org)

Claude Sonnet 4.6 Gives You Flexibility

Zvi23 Feb 2026 20:30 UTC

29 points

1 comment9 min readLW link

(thezvi.wordpress.com)

Secrets of the LessWrong RSS Feed

Brendan Long23 Feb 2026 20:12 UTC

36 points

6 comments4 min readLW link

Which questions can’t we punt?

Lizka23 Feb 2026 19:17 UTC

39 points

2 comments15 min readLW link

Exponential GDP growth from linear growth in variety of goods

Will_Howard23 Feb 2026 18:50 UTC

4 points

2 comments5 min readLW link

(open.substack.com)

Pre-training data poisoning likely makes installing secret loyalties easier

Joe Kwon23 Feb 2026 18:12 UTC

12 points

0 comments4 min readLW link

The 2028 Global Intelligence Crisis—a finance-oriented vignette

Rasool23 Feb 2026 17:12 UTC

50 points

13 comments1 min readLW link

(www.citriniresearch.com)

AI Impact Summit 2026 : A Field Report

Aditya and bhishma

23 Feb 2026 16:58 UTC

38 points

1 comment9 min readLW link

The map of the map is not the map

jimmy23 Feb 2026 16:54 UTC

18 points

3 comments9 min readLW link

Fact-checking an AI optimist article in The Economist

ToSummarise23 Feb 2026 13:56 UTC

41 points

3 comments4 min readLW link

(www.tosummarise.com)

Review: “We can’t disagree forever”

Martin Randall23 Feb 2026 13:17 UTC

15 points

0 comments3 min readLW link

Why I Think Pause is Impossible

E.G. Blee-Goldman23 Feb 2026 11:58 UTC

1 point

4 comments6 min readLW link

Can Aha Moments be Fake? Identifying True and Decorative Thinking Steps in CoT

Jiachen Zhao23 Feb 2026 11:51 UTC

24 points

0 comments10 min readLW link

(arxiv.org)

A World Without Violet: Peculiar Consequences of Granting Moral Status to Artificial Intelligences

Sever Topan23 Feb 2026 7:23 UTC

17 points

8 comments4 min readLW link

(severtopan.substack.com)

Was It Owl a Dream?

Yovel Rom23 Feb 2026 5:07 UTC

17 points

4 comments4 min readLW link

(yovelrom.substack.com)

Innate Immunity

joec23 Feb 2026 5:00 UTC

23 points

2 comments6 min readLW link

Why I Transitioned: A Third (FtM) Perspective

Character#273623 Feb 2026 4:39 UTC

22 points

6 comments14 min readLW link

The power of a simple 3-way truth scale

Bruce Lewis23 Feb 2026 2:41 UTC

4 points

2 comments2 min readLW link

Storing Food

jefftk23 Feb 2026 1:40 UTC

77 points

9 comments2 min readLW link

(www.jefftk.com)

Old SUNY Dorm Logic is not helping rural population collapse in NY.

Edd Schneider23 Feb 2026 1:28 UTC

9 points

4 comments3 min readLW link

Changing the world for the worse

mingyuan22 Feb 2026 23:55 UTC

129 points

17 comments3 min readLW link

(mingyuan.substack.com)