All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

AllJanFeb Mar Apr May Jun

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

How AI Is Learning to Think in Secret

Nicholas Andresen6 Jan 2026 16:31 UTC

382 points

32 comments18 min readLW link

(nickandresen.substack.com)

AI found 12 of 12 OpenSSL zero-days (while curl cancelled its bug bounty)

Stanislav Fort27 Jan 2026 20:21 UTC

359 points

24 comments8 min readLW link

In My Misanthropy Era

jenn4 Jan 2026 18:34 UTC

336 points

153 comments8 min readLW link

(jenn.site)

Canada Lost Its Measles Elimination Status Because We Don’t Have Enough Nurses Who Speak Low German

jenn25 Jan 2026 18:33 UTC

325 points

24 comments7 min readLW link

(www.jenn.site)

Ada Palmer: Inventing the Renaissance

Martin Sustrik26 Jan 2026 4:40 UTC

301 points

20 comments13 min readLW link

(www.250bpm.com)

2025 in AI predictions

jessicata2 Jan 2026 4:29 UTC

245 points

19 comments11 min readLW link

“The first two weeks are the hardest”: my first digital declutter

mingyuan18 Jan 2026 22:04 UTC

219 points

11 comments2 min readLW link

(mingyuan.substack.com)

How to Hire a Team

Gretta Duleba29 Jan 2026 22:39 UTC

206 points

13 comments5 min readLW link

AlgZoo: uninterpreted models with fewer than 1,500 parameters

Jacob_Hilton26 Jan 2026 17:30 UTC

181 points

7 comments10 min readLW link

(www.alignment.org)

Claude’s new constitution

Zac Hatfield-Dodds and Drake Thomas

21 Jan 2026 19:37 UTC

176 points

47 comments6 min readLW link

(www.anthropic.com)

Backyard cat fight shows Schelling points preexist language

jchan14 Jan 2026 14:10 UTC

172 points

25 comments3 min readLW link

Precedents for the Unprecedented: Historical Analogies for Thirteen Artificial Superintelligence Risks

James_Miller16 Jan 2026 18:43 UTC

165 points

15 comments63 min readLW link

Why I Transitioned: A Response

quinoa marisa20 Jan 2026 2:06 UTC

156 points

47 comments10 min readLW link

On Owning Galaxies

Simon Lermen6 Jan 2026 18:16 UTC

154 points

62 comments3 min readLW link

(simonlermen.substack.com)

Deep learning as program synthesis

Zach Furman20 Jan 2026 15:35 UTC

150 points

33 comments41 min readLW link

Dario Amodei – The Adolescence of Technology

habryka26 Jan 2026 19:10 UTC

147 points

62 comments73 min readLW link

(www.darioamodei.com)

The inaugural Redwood Research podcast

Buck and ryan_greenblatt

4 Jan 2026 22:11 UTC

146 points

10 comments142 min readLW link

Does Pentagon Pizza Theory Work?

rba22 Jan 2026 19:24 UTC

140 points

11 comments5 min readLW link

(goflaw.substack.com)

Why we are excited about confession!

Boaz Barak, Gabriel Wu and Manas Joglekar

14 Jan 2026 20:37 UTC

138 points

32 comments9 min readLW link

(alignment.openai.com)

What Washington Says About AGI

Zephaniah Roe17 Jan 2026 5:43 UTC

134 points

7 comments6 min readLW link

Recent LLMs can do 2-hop and 3-hop latent (no-CoT) reasoning on natural facts

ryan_greenblatt1 Jan 2026 13:36 UTC

129 points

11 comments3 min readLW link

The Possessed Machines (summary)

L Rudolf L25 Jan 2026 20:47 UTC

128 points

31 comments9 min readLW link

(possessedmachines.com)

Lightcone is hiring a generalist, a designer, and a campus operations co-lead

habryka17 Jan 2026 1:47 UTC

118 points

0 comments5 min readLW link

Bentham’s Bulldog is wrong about AI risk

Max Harms29 Jan 2026 16:33 UTC

109 points

37 comments33 min readLW link

Taiwan war timelines might be shorter than AI timelines

Baram Sosis1 Jan 2026 22:30 UTC

108 points

21 comments5 min readLW link

Pretraining on Aligned AI Data Dramatically Reduces Misalignment—Even After Post-Training

RogerDearnaley19 Jan 2026 21:24 UTC

106 points

12 comments11 min readLW link

(arxiv.org)

Why AIs aren’t power-seeking yet

Eli Tyre11 Jan 2026 7:07 UTC

105 points

16 comments7 min readLW link

Notable Progress Has Been Made in Whole Brain Emulation

Dom Polsinelli25 Jan 2026 19:07 UTC

103 points

15 comments6 min readLW link

(open.substack.com)

Lies, Damned Lies, and Proofs: Formal Methods are not Slopless

Quinn and Max von Hippel

12 Jan 2026 22:32 UTC

102 points

10 comments7 min readLW link

To be well-calibrated is to be punctual

moridinamael25 Jan 2026 18:10 UTC

97 points

17 comments2 min readLW link

Every Benchmark is Broken

Jonathan Gabor24 Jan 2026 2:42 UTC

95 points

0 comments4 min readLW link

(jonathanpgabor.substack.com)

Fitness-Seekers: Generalizing the Reward-Seeking Threat Model

Alex Mallen29 Jan 2026 19:42 UTC

92 points

5 comments17 min readLW link

Test your interpretability techniques by de-censoring Chinese models

Khoi Tran, aryaj, Senthooran Rajamanoharan and Neel Nanda

15 Jan 2026 16:33 UTC

91 points

14 comments20 min readLW link

IABIED Book Review: Core Arguments and Counterarguments

Stephen McAleese24 Jan 2026 14:25 UTC

90 points

39 comments25 min readLW link

College Was Not That Terrible Now That I’m Not That Crazy

Zack_M_Davis1 Jan 2026 23:14 UTC

90 points

9 comments44 min readLW link

(zackmdavis.net)

Split Personality Training: Revealing Latent Knowledge Through Alternate Personalities (Research Report)

Florian_Dietz12 Jan 2026 12:29 UTC

87 points

41 comments26 min readLW link

Tensor-Transformer Variants are Surprisingly Performant

Logan Riggs12 Jan 2026 19:43 UTC

87 points

15 comments4 min readLW link

We need a better way to evaluate emergent misalignment

yix and Broyojo

11 Jan 2026 16:21 UTC

86 points

9 comments6 min readLW link

36,000 AI Agents Are Now Speedrunning Civilization

Michaël Trazzi30 Jan 2026 21:21 UTC

86 points

27 comments1 min readLW link

Oversight Assistants: Turning Compute into Understanding

jsteinhardt6 Jan 2026 0:50 UTC

85 points

7 comments9 min readLW link

(bounded-regret.ghost.io)

Refusals that could become catastrophic

Fabien Roger30 Jan 2026 4:12 UTC

84 points

12 comments7 min readLW link

Are We in a Continual Learning Overhang?

Samuel Knoche29 Jan 2026 17:09 UTC

83 points

5 comments14 min readLW link

The truth behind the 2026 J.P. Morgan Healthcare Conference

Abhishaike Mahajan17 Jan 2026 17:28 UTC

83 points

35 comments9 min readLW link

(www.owlposting.com)

Strong, bipartisan leadership for resistance to Trump.

Raemon11 Jan 2026 23:08 UTC

82 points

85 comments2 min readLW link

When the LLM isn’t the one who’s wrong

Julian Bradshaw18 Jan 2026 21:37 UTC

81 points

9 comments2 min readLW link

Overwhelming Superintelligence

Raemon1 Jan 2026 20:51 UTC

80 points

30 comments1 min readLW link

Reflections on TA-ing Harvard’s first AI safety course

Roy Rinberg15 Jan 2026 16:28 UTC

79 points

4 comments9 min readLW link

Public intellectuals need to say what they actually believe

Aaron Bergman7 Jan 2026 21:22 UTC

79 points

12 comments14 min readLW link

(www.aaronbergman.net)

Why Motivated Reasoning?

johnswentworth14 Jan 2026 19:55 UTC

78 points

20 comments5 min readLW link

Open Problems With Claude’s Constitution

Zvi28 Jan 2026 14:20 UTC

75 points

1 comment24 min readLW link

(thezvi.wordpress.com)