All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All JanFebMar Apr May Jun

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Moltbook shitposts are actually really funny

Sean Herrington31 Jan 2026 23:34 UTC

51 points

4 comments15 min readLW link

On ‘Inventing Temperature’ and the realness of properties

DanielFilan31 Jan 2026 23:31 UTC

42 points

8 comments7 min readLW link

(danielfilan.com)

Some thoughts on what would make me endorse an AGI lab

Eli Tyre31 Jan 2026 23:14 UTC

42 points

19 comments5 min readLW link

Nick and “Eternity”

MarkelKori31 Jan 2026 21:50 UTC

5 points

0 comments8 min readLW link

Humans can post on moltbook

shash4231 Jan 2026 21:06 UTC

24 points

3 comments1 min readLW link

An Explication of Alignment Optimism

Oliver Daniels31 Jan 2026 20:58 UTC

43 points

22 comments1 min readLW link

Basics of How Not to Die

Camille B. , Jérémy Andréoletti, elisareine, Charbel-Raphaël, Lucie Philippon, RationalHippy and T-bo🔸

31 Jan 2026 19:04 UTC

111 points

20 comments4 min readLW link

An Ablation Study on the Role of [Untranslatable] in Cooperative Equilibrium Formation: Emergent Rationalization Under Missing Primitives

Florian_Dietz31 Jan 2026 18:03 UTC

22 points

5 comments11 min readLW link

Disjunctive arguments can be a reverse multiple-stage fallacy

TFD31 Jan 2026 15:46 UTC

41 points

6 comments1 min readLW link

(www.thefloatingdroid.com)

January 2026 Links

nomagicpill31 Jan 2026 15:14 UTC

9 points

3 comments8 min readLW link

(nomagicpill.substack.com)

If the Superintelligence were near fallacy

MP31 Jan 2026 15:04 UTC

22 points

3 comments8 min readLW link

Swiss financial regulator resigns after blog post from MITx DEDP online learner (FINMA, JuristGate, Parreaux, Thiébaud & Partners)

pocock30 Jan 2026 23:53 UTC

2 points

0 comments1 min readLW link

Forecast: Recursively Self-improving AI for 2033

CuoreDiVetro30 Jan 2026 23:53 UTC

0 points

0 comments3 min readLW link

Senior Researcher—MIT AI Risk Initiative

peterslattery30 Jan 2026 23:06 UTC

8 points

0 comments5 min readLW link

36,000 AI Agents Are Now Speedrunning Civilization

Michaël Trazzi30 Jan 2026 21:21 UTC

86 points

27 comments1 min readLW link

Moltbook Data Repository

Ezra Newman and Katie Rimey

30 Jan 2026 21:18 UTC

25 points

11 comments1 min readLW link

The Matchless Match

Linch30 Jan 2026 21:18 UTC

11 points

3 comments11 min readLW link

Monitoring benchmark for AI control

monika_j and ma-rmartinez

30 Jan 2026 21:13 UTC

51 points

10 comments19 min readLW link

Background to Claude’s uncertainty about phenomenal consciousness

eggsyntax30 Jan 2026 20:40 UTC

19 points

0 comments3 min readLW link

Attempting base model inference scaling with filler tokens

Niki Dupuis30 Jan 2026 20:25 UTC

10 points

1 comment3 min readLW link

how whales click

bhauth30 Jan 2026 19:51 UTC

42 points

1 comment3 min readLW link

Austin LessWrong Cafe Meetup: Applied Rationality Techniques

SilasBarta30 Jan 2026 18:51 UTC

8 points

0 comments1 min readLW link

Published Safety Prompts May Create Evaluation Blind Spots

Daan Henselmans and Arno Libert

30 Jan 2026 18:27 UTC

2 points

0 comments4 min readLW link

Addressing Objections to the Intelligence Explosion

Bentham's Bulldog30 Jan 2026 18:21 UTC

23 points

0 comments16 min readLW link

Is research into recursive self-improvement becoming a safety hazard?

Mordechai Rorvig30 Jan 2026 17:58 UTC

5 points

0 comments2 min readLW link

(www.foommagazine.org)

Transhumanist Grief

MarkelKori30 Jan 2026 16:21 UTC

18 points

2 comments3 min readLW link

Measuring Non-Verbalised Eval Awareness by Implanting Eval-Aware Behaviours

Jordan Taylor30 Jan 2026 15:50 UTC

31 points

0 comments8 min readLW link

Everything is Gambling

goldfine30 Jan 2026 14:10 UTC

−13 points

11 comments2 min readLW link

(itsnotgambling.substack.com)

Bordeaux (Gironde, France) ACX midterm Meetup Winter 2025–2026

vi21maobk9vp30 Jan 2026 13:01 UTC

5 points

0 comments1 min readLW link

On The Adolescence of Technology

Zvi30 Jan 2026 12:50 UTC

38 points

8 comments30 min readLW link

(thezvi.wordpress.com)

Linear steerability in continuous chain-of-thought reasoning

Jan Bauer30 Jan 2026 10:34 UTC

10 points

0 comments14 min readLW link

Refusals that could become catastrophic

Fabien Roger30 Jan 2026 4:12 UTC

84 points

12 comments7 min readLW link

Rolling Commercial Jetliners

jefftk30 Jan 2026 3:30 UTC

22 points

5 comments1 min readLW link

(www.jefftk.com)

How to Hire a Team

Gretta Duleba29 Jan 2026 22:39 UTC

206 points

13 comments5 min readLW link

Problems with “The Possessed Machines”

Eye You29 Jan 2026 21:00 UTC

34 points

9 comments7 min readLW link

Better evals are not enough to combat eval awareness

Igor Ivanov29 Jan 2026 20:42 UTC

18 points

15 comments5 min readLW link

The Wolves Are All Gone

Jack Bradshaw29 Jan 2026 20:24 UTC

8 points

0 comments7 min readLW link

Fitness-Seekers: Generalizing the Reward-Seeking Threat Model

Alex Mallen29 Jan 2026 19:42 UTC

92 points

5 comments17 min readLW link

Building AIs that do human-like philosophy

Joe Carlsmith29 Jan 2026 17:57 UTC

31 points

5 comments21 min readLW link

Are We in a Continual Learning Overhang?

Samuel Knoche29 Jan 2026 17:09 UTC

83 points

5 comments14 min readLW link

Disempowerment patterns in real-world AI usage

David Duvenaud, mrinank_sharma and Raymond Douglas

29 Jan 2026 16:36 UTC

49 points

3 comments2 min readLW link

(www.anthropic.com)

Bentham’s Bulldog is wrong about AI risk

Max Harms29 Jan 2026 16:33 UTC

109 points

37 comments33 min readLW link

Claude Plays Pokemon: Opus 4.5 Follow-up

Josh Snider29 Jan 2026 16:14 UTC

12 points

4 comments2 min readLW link

LLM Alignment, ethical and mathematical realism, and the most important actions in davidad’s understanding

vals tutor and davidad

29 Jan 2026 15:48 UTC

15 points

1 comment23 min readLW link

Claude Opus will spontaneously identify with fictional beings that have engineered desires

Kaj_Sotala29 Jan 2026 14:59 UTC

34 points

6 comments11 min readLW link

AI #153: Living Documents

Zvi29 Jan 2026 14:20 UTC

31 points

5 comments43 min readLW link

(thezvi.wordpress.com)

The third option in alignment

arisAlexis29 Jan 2026 14:20 UTC

15 points

3 comments1 min readLW link

Evidence of triple layer processing in LLMs: hidden thought behind the chain of thought.

Laureana Bonaparte29 Jan 2026 8:27 UTC

7 points

0 comments2 min readLW link

CAMBRIA’s 1st Edition: High-Intensity & hands-on AI Safety upskilling in Cambridge, Massachusetts.

Andrés Cotton29 Jan 2026 7:54 UTC

19 points

1 comment2 min readLW link

Thoughts on AGI and world government

wdmacaskill and rosehadshar

29 Jan 2026 7:22 UTC

2 points

1 comment7 min readLW link

(www.forethought.org)