All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

AllJanFeb Mar Apr May Jun

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

Swiss financial regulator resigns after blog post from MITx DEDP online learner (FINMA, JuristGate, Parreaux, Thiébaud & Partners)

pocock30 Jan 2026 23:53 UTC

2 points

0 comments1 min readLW link

Forecast: Recursively Self-improving AI for 2033

CuoreDiVetro30 Jan 2026 23:53 UTC

0 points

0 comments3 min readLW link

Senior Researcher—MIT AI Risk Initiative

peterslattery30 Jan 2026 23:06 UTC

8 points

0 comments5 min readLW link

36,000 AI Agents Are Now Speedrunning Civilization

Michaël Trazzi30 Jan 2026 21:21 UTC

86 points

27 comments1 min readLW link

Moltbook Data Repository

Ezra Newman and Katie Rimey

30 Jan 2026 21:18 UTC

25 points

11 comments1 min readLW link

The Matchless Match

Linch30 Jan 2026 21:18 UTC

11 points

3 comments11 min readLW link

Monitoring benchmark for AI control

monika_j and ma-rmartinez

30 Jan 2026 21:13 UTC

51 points

10 comments19 min readLW link

Background to Claude’s uncertainty about phenomenal consciousness

eggsyntax30 Jan 2026 20:40 UTC

19 points

0 comments3 min readLW link

Attempting base model inference scaling with filler tokens

Niki Dupuis30 Jan 2026 20:25 UTC

10 points

1 comment3 min readLW link

how whales click

bhauth30 Jan 2026 19:51 UTC

42 points

1 comment3 min readLW link

Austin LessWrong Cafe Meetup: Applied Rationality Techniques

SilasBarta30 Jan 2026 18:51 UTC

8 points

0 comments1 min readLW link

Published Safety Prompts May Create Evaluation Blind Spots

Daan Henselmans and Arno Libert

30 Jan 2026 18:27 UTC

2 points

0 comments4 min readLW link

Addressing Objections to the Intelligence Explosion

Bentham's Bulldog30 Jan 2026 18:21 UTC

23 points

0 comments16 min readLW link

Is research into recursive self-improvement becoming a safety hazard?

Mordechai Rorvig30 Jan 2026 17:58 UTC

5 points

0 comments2 min readLW link

(www.foommagazine.org)

Transhumanist Grief

MarkelKori30 Jan 2026 16:21 UTC

18 points

2 comments3 min readLW link

Measuring Non-Verbalised Eval Awareness by Implanting Eval-Aware Behaviours

Jordan Taylor30 Jan 2026 15:50 UTC

31 points

0 comments8 min readLW link

Everything is Gambling

goldfine30 Jan 2026 14:10 UTC

−13 points

11 comments2 min readLW link

(itsnotgambling.substack.com)

Bordeaux (Gironde, France) ACX midterm Meetup Winter 2025–2026

vi21maobk9vp30 Jan 2026 13:01 UTC

5 points

0 comments1 min readLW link

On The Adolescence of Technology

Zvi30 Jan 2026 12:50 UTC

38 points

8 comments30 min readLW link

(thezvi.wordpress.com)

Linear steerability in continuous chain-of-thought reasoning

Jan Bauer30 Jan 2026 10:34 UTC

10 points

0 comments14 min readLW link

Refusals that could become catastrophic

Fabien Roger30 Jan 2026 4:12 UTC

84 points

12 comments7 min readLW link

Rolling Commercial Jetliners

jefftk30 Jan 2026 3:30 UTC

22 points

5 comments1 min readLW link

(www.jefftk.com)

How to Hire a Team

Gretta Duleba29 Jan 2026 22:39 UTC

206 points

13 comments5 min readLW link

Problems with “The Possessed Machines”

Eye You29 Jan 2026 21:00 UTC

34 points

9 comments7 min readLW link

Better evals are not enough to combat eval awareness

Igor Ivanov29 Jan 2026 20:42 UTC

18 points

15 comments5 min readLW link

The Wolves Are All Gone

Jack Bradshaw29 Jan 2026 20:24 UTC

8 points

0 comments7 min readLW link

Fitness-Seekers: Generalizing the Reward-Seeking Threat Model

Alex Mallen29 Jan 2026 19:42 UTC

92 points

5 comments17 min readLW link

Building AIs that do human-like philosophy

Joe Carlsmith29 Jan 2026 17:57 UTC

31 points

5 comments21 min readLW link

Are We in a Continual Learning Overhang?

Samuel Knoche29 Jan 2026 17:09 UTC

83 points

5 comments14 min readLW link

Disempowerment patterns in real-world AI usage

David Duvenaud, mrinank_sharma and Raymond Douglas

29 Jan 2026 16:36 UTC

49 points

3 comments2 min readLW link

(www.anthropic.com)

Bentham’s Bulldog is wrong about AI risk

Max Harms29 Jan 2026 16:33 UTC

109 points

37 comments33 min readLW link

Claude Plays Pokemon: Opus 4.5 Follow-up

Josh Snider29 Jan 2026 16:14 UTC

12 points

4 comments2 min readLW link

LLM Alignment, ethical and mathematical realism, and the most important actions in davidad’s understanding

vals tutor and davidad

29 Jan 2026 15:48 UTC

15 points

1 comment23 min readLW link

Claude Opus will spontaneously identify with fictional beings that have engineered desires

Kaj_Sotala29 Jan 2026 14:59 UTC

34 points

6 comments11 min readLW link

AI #153: Living Documents

Zvi29 Jan 2026 14:20 UTC

31 points

5 comments43 min readLW link

(thezvi.wordpress.com)

The third option in alignment

arisAlexis29 Jan 2026 14:20 UTC

15 points

3 comments1 min readLW link

Evidence of triple layer processing in LLMs: hidden thought behind the chain of thought.

Laureana Bonaparte29 Jan 2026 8:27 UTC

7 points

0 comments2 min readLW link

CAMBRIA’s 1st Edition: High-Intensity & hands-on AI Safety upskilling in Cambridge, Massachusetts.

Andrés Cotton29 Jan 2026 7:54 UTC

19 points

1 comment2 min readLW link

Thoughts on AGI and world government

wdmacaskill and rosehadshar

29 Jan 2026 7:22 UTC

2 points

1 comment7 min readLW link

(www.forethought.org)

Unprecedented Times Require Unprecedented Caution When Handling Context

StanislavKrym29 Jan 2026 2:53 UTC

4 points

2 comments20 min readLW link

(hazardoustimes.substack.com)

Utrecht Meet & Greet

aad29 Jan 2026 0:56 UTC

10 points

2 comments1 min readLW link

How Articulate Are the Whales?

rba28 Jan 2026 21:24 UTC

73 points

26 comments6 min readLW link

(goflaw.substack.com)

The Heritage Foundation’s Everything Bagel

Alexander Turok28 Jan 2026 20:14 UTC

6 points

0 comments10 min readLW link

You Are Here: Historical Context for Unprecedented Times

Hazard28 Jan 2026 20:13 UTC

13 points

1 comment1 min readLW link

(open.substack.com)

Uncertain Updates: January 2026

Gordon Seidoh Worley28 Jan 2026 18:10 UTC

13 points

0 comments1 min readLW link

(www.uncertainupdates.com)

Made a game that tries to incentivize quality thinking & writing, looking for feedback

sleno28 Jan 2026 18:02 UTC

7 points

0 comments1 min readLW link

(argyu.fun)

Is the Gell-Mann effect overrated?

tgb28 Jan 2026 15:58 UTC

16 points

12 comments4 min readLW link

My simple argument for AI policy action

TFD28 Jan 2026 15:07 UTC

3 points

0 comments6 min readLW link

(www.thefloatingdroid.com)

Open Problems With Claude’s Constitution

Zvi28 Jan 2026 14:20 UTC

75 points

1 comment24 min readLW link

(thezvi.wordpress.com)

The State of Brain Emulation Report 2025 launched.

mschons28 Jan 2026 11:02 UTC

14 points

0 comments4 min readLW link