My jour­ney to the microwave al­ter­nate timeline

Malmesbury10 Feb 2026 17:59 UTC
780 points
58 comments10 min readLW link

Here’s to the Polypropy­lene Makers

jefftk27 Feb 2026 4:00 UTC
553 points
19 comments2 min readLW link
(www.jefftk.com)

Did Claude 3 Opus al­ign it­self via gra­di­ent hack­ing?

Fiora Starlight21 Feb 2026 22:24 UTC
390 points
49 comments20 min readLW link

Life at the Frontlines of De­mo­graphic Collapse

Martin Sustrik14 Feb 2026 6:30 UTC
289 points
52 comments8 min readLW link
(www.250bpm.com)

An­thropic’s “Hot Mess” pa­per over­states its case (and the blog post is worse)

RobertM4 Feb 2026 6:30 UTC
288 points
28 comments6 min readLW link

mod­els have some pretty funny at­trac­tor states

12 Feb 2026 21:14 UTC
275 points
38 comments18 min readLW link

Why You Don’t Believe in Xhosa Prophecies

Jan_Kulveit13 Feb 2026 2:25 UTC
265 points
28 comments4 min readLW link

Gyre

vgel17 Feb 2026 0:38 UTC
260 points
24 comments8 min readLW link
(vgel.me)

Post-AGI Eco­nomics As If Noth­ing Ever Happens

Jan_Kulveit4 Feb 2026 17:39 UTC
254 points
43 comments8 min readLW link
(boundedlyrational.substack.com)

The Spec­tre haunt­ing the “AI Safety” Community

Gabriel Alfour21 Feb 2026 11:14 UTC
233 points
28 comments6 min readLW link
(cognition.cafe)

Open sourc­ing a browser ex­ten­sion that shows when peo­ple are wrong on the internet

lc24 Feb 2026 16:36 UTC
226 points
34 comments2 min readLW link
(github.com)

Near-In­stantly Abort­ing the Worst Pain Imag­in­able with Psychedelics

eleweek7 Feb 2026 16:11 UTC
217 points
13 comments13 min readLW link
(psychotechnology.substack.com)

The World Keeps Get­ting Saved and You Don’t Notice

Bogoed16 Feb 2026 1:01 UTC
210 points
20 comments2 min readLW link

The op­ti­mal age to freeze eggs is 19

GeneSmith8 Feb 2026 9:44 UTC
195 points
48 comments6 min readLW link

Re­spon­si­ble Scal­ing Policy v3

HoldenKarnofsky24 Feb 2026 20:20 UTC
179 points
82 comments36 min readLW link

You’re an AI Ex­pert – Not an Influencer

Max Winga17 Feb 2026 21:03 UTC
179 points
25 comments6 min readLW link
(maxwinga.substack.com)

Per­sona Parasitology

Raymond Douglas16 Feb 2026 16:22 UTC
176 points
38 comments11 min readLW link

What We Learned from Briefing 140+ Law­mak­ers on the Threat from AI

leticiagarcia12 Feb 2026 19:53 UTC
173 points
7 comments14 min readLW link
(substack.com)

The per­sona se­lec­tion model

Sam Marks23 Feb 2026 22:56 UTC
172 points
53 comments43 min readLW link
(alignment.anthropic.com)

Stone Age Billion­aire Can’t Words Good

Eneasz9 Feb 2026 18:51 UTC
169 points
95 comments12 min readLW link
(deathisbad.substack.com)

Con­di­tional Kick­starter for the “Don’t Build It” March

Raemon2 Feb 2026 22:58 UTC
165 points
35 comments4 min readLW link

Are there les­sons from high-re­li­a­bil­ity en­g­ineer­ing for AGI safety?

Steven Byrnes2 Feb 2026 15:26 UTC
161 points
15 comments8 min readLW link

Prompt in­jec­tion in Google Trans­late re­veals base model be­hav­iors be­hind task-spe­cific fine-tuning

megasilverfist7 Feb 2026 13:56 UTC
160 points
27 comments3 min readLW link

An­thropic: “State­ment from Dario Amodei on our dis­cus­sions with the Depart­ment of War”

Matrice Jacobine26 Feb 2026 23:45 UTC
159 points
22 comments3 min readLW link
(www.anthropic.com)

Why we should ex­pect ruth­less so­ciopath ASI

Steven Byrnes18 Feb 2026 17:28 UTC
156 points
63 comments8 min readLW link

Weight-Sparse Cir­cuits May Be In­ter­pretable Yet Unfaithful

jacob_drori9 Feb 2026 23:25 UTC
136 points
5 comments8 min readLW link

On Goal-Models

Richard_Ngo2 Feb 2026 18:44 UTC
136 points
15 comments4 min readLW link

Fron­tier AI com­pa­nies prob­a­bly can’t leave the US

Anders Cairns Woodruff26 Feb 2026 18:18 UTC
136 points
19 comments7 min readLW link
(blog.redwoodresearch.org)

Honey, I shrunk the brain

Andy_McKenzie7 Feb 2026 0:01 UTC
128 points
1 comment5 min readLW link
(neurobiology.substack.com)

Solemn Courage

aysja4 Feb 2026 23:09 UTC
128 points
1 comment6 min readLW link

Chang­ing the world for the worse

mingyuan22 Feb 2026 23:55 UTC
122 points
17 comments3 min readLW link
(mingyuan.substack.com)

You May Already Be Canadian

jefftk19 Feb 2026 16:00 UTC
120 points
14 comments1 min readLW link
(www.jefftk.com)

Ir­ra­tional­ity is So­cially Strategic

Valentine18 Feb 2026 13:28 UTC
119 points
18 comments13 min readLW link

Opus 4.6 Rea­son­ing Doesn’t Ver­bal­ize Align­ment Fak­ing, but Be­hav­ior Persists

9 Feb 2026 12:55 UTC
118 points
13 comments8 min readLW link

The na­ture of LLM al­gorith­mic progress (v2)

Steven Byrnes5 Feb 2026 19:17 UTC
116 points
27 comments13 min readLW link

Su­per­in­tel­li­gence Align­ment Sem­i­nar (1 month fo­cused up­skil­ling)

Mateusz Bagiński17 Feb 2026 17:03 UTC
115 points
13 comments3 min readLW link

The brain is a ma­chine that runs an algorithm

Steven Byrnes17 Feb 2026 19:36 UTC
114 points
18 comments4 min readLW link

Claude Opus 4.6 is Driven

HunterJay6 Feb 2026 4:15 UTC
113 points
1 comment5 min readLW link

Smokey, This is not ’Nam Or: [Already] over the [red] line!

Davidmanheim8 Feb 2026 12:24 UTC
110 points
22 comments4 min readLW link

The ML on­tol­ogy and the al­ign­ment ontology

Richard_Ngo24 Feb 2026 4:39 UTC
110 points
9 comments4 min readLW link

New ARENA ma­te­rial: 8 ex­er­cise sets on al­ign­ment sci­ence & interpretability

CallumMcDougall27 Feb 2026 17:37 UTC
104 points
1 comment7 min readLW link

It Is Rea­son­able To Re­search How To Use Model In­ter­nals In Training

Neel Nanda8 Feb 2026 3:44 UTC
103 points
15 comments4 min readLW link

Gem­ini’s Hy­po­thet­i­cal Present

jefftk13 Feb 2026 2:20 UTC
101 points
9 comments2 min readLW link
(www.jefftk.com)

Whack-a-Mole is Not a Winnable Game

Sable26 Feb 2026 2:40 UTC
101 points
26 comments18 min readLW link
(affablyevil.substack.com)

Long-term risks from ide­olog­i­cal fanaticism

12 Feb 2026 23:26 UTC
99 points
12 comments84 min readLW link

Vot­ing Re­sults for the 2024 Review

RobertM7 Feb 2026 3:48 UTC
98 points
0 comments1 min readLW link

If you don’t feel deeply con­fused about AGI risk, some­thing’s wrong

Dave Banerjee21 Feb 2026 15:34 UTC
95 points
18 comments5 min readLW link
(open.substack.com)

Ex­clu­sive: Hegseth gives An­thropic un­til Fri­day to back down on AI safeguards

Matrice Jacobine24 Feb 2026 19:19 UTC
95 points
9 comments3 min readLW link
(www.axios.com)

Align­ing to Virtues

Richard_Ngo16 Feb 2026 0:37 UTC
93 points
36 comments4 min readLW link

What se­cret goals does Claude think it has?

loops25 Feb 2026 19:22 UTC
92 points
11 comments4 min readLW link