All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

AllJanFeb Mar Apr May Jun

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Results: A self-randomized study of the impacts of glycine on sleep (Science is still hard)

thedissonance.net6 Jan 2026 20:54 UTC

6 points

1 comment3 min readLW link

(thedissonance.net)

My 2003 Post on the Evolutionary Argument for AI Misalignment

Wei Dai6 Jan 2026 20:45 UTC

37 points

7 comments2 min readLW link

Mainstream approach for alignment evals is a dead end

Igor Ivanov6 Jan 2026 19:52 UTC

60 points

9 comments5 min readLW link

Fertility Roundup #6: The Art of More Dakka

Zvi6 Jan 2026 19:50 UTC

32 points

5 comments26 min readLW link

(thezvi.wordpress.com)

On Owning Galaxies

Simon Lermen6 Jan 2026 18:16 UTC

154 points

62 comments3 min readLW link

(simonlermen.substack.com)

How hard is it to inoculate against misalignment generalization?

Jozdien6 Jan 2026 17:30 UTC

46 points

4 comments14 min readLW link

How AI Is Learning to Think in Secret

Nicholas Andresen6 Jan 2026 16:31 UTC

382 points

32 comments18 min readLW link

(nickandresen.substack.com)

Should you be posting on the open internet

zef6 Jan 2026 15:50 UTC

22 points

9 comments2 min readLW link

Catching misreporting about ML hardware use by turning noise into signal—Part II

Naci Cankaya6 Jan 2026 12:38 UTC

8 points

0 comments1 min readLW link

(nacicankaya.substack.com)

Meditations on Moloch in the AI Rat Race

Alexander Müller6 Jan 2026 9:46 UTC

11 points

1 comment6 min readLW link

[Question] Is anyone doing a real-world test of agentic misalignment?

Jamie Milton Freestone6 Jan 2026 7:45 UTC

2 points

1 comment1 min readLW link

Do we need sparsity afterall?

Giuseppe Birardi6 Jan 2026 6:06 UTC

20 points

5 comments29 min readLW link

Exploring Reinforcement Learning Effects on Chain-of-Thought Legibility

Julian H, RohanS, Baram Sosis, vedant-badoni and The-Turtle

6 Jan 2026 3:04 UTC

41 points

3 comments21 min readLW link

The Evolution Argument Sucks

peralice6 Jan 2026 2:32 UTC

30 points

6 comments8 min readLW link

Festival Stats 2025

jefftk6 Jan 2026 1:40 UTC

10 points

1 comment1 min readLW link

(www.jefftk.com)

Oversight Assistants: Turning Compute into Understanding

jsteinhardt6 Jan 2026 0:50 UTC

85 points

7 comments9 min readLW link

(bounded-regret.ghost.io)

Aether is hiring technical AI safety researchers

Rauno Arike, RohanS and Shubhorup Biswas

5 Jan 2026 22:27 UTC

22 points

0 comments2 min readLW link

[Question] Continual Learning Achieved?

PeterMcCluskey5 Jan 2026 22:22 UTC

−7 points

11 comments1 min readLW link

AGI will not be one specific system, it’ll be the unity of all systems

henophilia5 Jan 2026 18:21 UTC

−4 points

0 comments11 min readLW link

How to tame a complex system

jasoncrawford5 Jan 2026 18:20 UTC

27 points

0 comments2 min readLW link

(newsletter.rootsofprogress.org)

Broadening the training set for alignment

Seth Herd5 Jan 2026 17:30 UTC

40 points

11 comments9 min readLW link

Dos Capital

Zvi5 Jan 2026 16:40 UTC

71 points

10 comments17 min readLW link

(thezvi.wordpress.com)

Announcing the CLR Fundamentals Program

Tristan Cook5 Jan 2026 15:16 UTC

12 points

0 comments2 min readLW link

AI Risk timelines: 10% chance (by year X) should be the headline (and deadline), not 50%. And 10% is _this year_!

Greg C5 Jan 2026 11:57 UTC

61 points

18 comments1 min readLW link

Transformers, Intuitively

atharva5 Jan 2026 11:34 UTC

5 points

0 comments4 min readLW link

The Technology of Liberalism

L Rudolf L5 Jan 2026 11:04 UTC

41 points

7 comments29 min readLW link

(www.nosetgauge.com)

Axiological Stopsigns

JenniferRM5 Jan 2026 7:30 UTC

34 points

6 comments16 min readLW link

Artifical Expert/Expanded Narrow Intelligence, and Proto-AGI

Yuli_Ban5 Jan 2026 3:40 UTC

15 points

0 comments7 min readLW link

An Aphoristic Overview of Technical AI Alignment proposals

wassname5 Jan 2026 3:01 UTC

11 points

3 comments2 min readLW link

Claude Wrote Me a 400-Commit RSS Reader App

Brendan Long5 Jan 2026 2:52 UTC

35 points

11 comments3 min readLW link

(www.brendanlong.com)

The inaugural Redwood Research podcast

Buck and ryan_greenblatt

4 Jan 2026 22:11 UTC

146 points

10 comments142 min readLW link

LessOnline 2026 Improvement Ideas

nomagicpill4 Jan 2026 21:56 UTC

16 points

0 comments1 min readLW link

The economy is a graph, not a pipeline

anithite4 Jan 2026 21:48 UTC

33 points

10 comments4 min readLW link

Calling all college students (and new readers)

neo4 Jan 2026 21:20 UTC

15 points

0 comments1 min readLW link

Rock bottom terminal value

ihatenumbersinusernames74 Jan 2026 20:43 UTC

4 points

9 comments2 min readLW link

In My Misanthropy Era

jenn4 Jan 2026 18:34 UTC

352 points

153 comments8 min readLW link

(jenn.site)

The Thinking Machine

PeterMcCluskey4 Jan 2026 18:24 UTC

36 points

0 comments2 min readLW link

(bayesianinvestor.com)

The Maduro Polymarket bet is not “obviously insider trading”

ceselder4 Jan 2026 10:53 UTC

22 points

18 comments3 min readLW link

The Problem with Democracy

RandStrauss4 Jan 2026 7:11 UTC

−3 points

3 comments2 min readLW link

Examples of Subtle Alignment Failures from Claude and Gemini

Tachikoma4 Jan 2026 4:29 UTC

−9 points

1 comment5 min readLW link

Four Downsides of Training Policies Online

Alek Westover and egan

4 Jan 2026 3:17 UTC

29 points

4 comments3 min readLW link

Humanity’s Gambit

Ben Ihrig4 Jan 2026 3:08 UTC

5 points

5 comments3 min readLW link

Semantic Topological Spaces

TristanTrim4 Jan 2026 0:58 UTC

11 points

16 comments5 min readLW link

The surprising adequacy of the Roblox game marketplace

Esteban Restrepo3 Jan 2026 14:15 UTC

26 points

3 comments8 min readLW link

(papabos.substack.com)

Re: Anthropic Chinese Cyber-Attack. How Do We Protect Open-source Models?

Mayowa Osibodu3 Jan 2026 9:45 UTC

−1 points

2 comments6 min readLW link

Give Skepticism a Try

Ape in the coat3 Jan 2026 8:57 UTC

12 points

17 comments3 min readLW link

(apeinthecoat102771.substack.com)

Why We Should Talk Specifically Amid Uncertainty

sbaumohl3 Jan 2026 3:04 UTC

11 points

1 comment7 min readLW link

Companies as “proto-ASI”

beyarkay (Boyd Kane)3 Jan 2026 0:24 UTC

15 points

3 comments1 min readLW link

(boydkane.com)

AXRP Episode 47 - David Rein on METR Time Horizons

DanielFilan3 Jan 2026 0:10 UTC

21 points

0 comments46 min readLW link

The Weirdness of Dating/Mating: Deep Nonconsent Preference

johnswentworth2 Jan 2026 23:05 UTC

12 points

61 comments6 min readLW link