All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

AllJanFeb Mar Apr May Jun

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Taiwan Trip Report

nomagicpill7 Jan 2026 23:40 UTC

11 points

0 comments9 min readLW link

(nomagicpill.substack.com)

Public intellectuals need to say what they actually believe

Aaron Bergman7 Jan 2026 21:22 UTC

79 points

12 comments14 min readLW link

(www.aaronbergman.net)

Two Aspects of Situational Awareness: World Modelling & Indexical Information

David Scott Krueger7 Jan 2026 20:24 UTC

40 points

7 comments2 min readLW link

Advancements In Self-Driving Cars

Zvi7 Jan 2026 19:50 UTC

30 points

2 comments17 min readLW link

(thezvi.wordpress.com)

Two ways non-U.S. folks can contribute to AI going well

Joe Rogero7 Jan 2026 19:37 UTC

21 points

1 comment2 min readLW link

(subatomicarticles.com)

Everything is Political Now, or, A Review of “Fraggle Rock: Back to the Rock”

Gordon Seidoh Worley7 Jan 2026 17:00 UTC

13 points

0 comments8 min readLW link

(www.uncertainupdates.com)

FirstPrinciples Talks: Quantum machines learning quantum

Carly Turini7 Jan 2026 16:44 UTC

3 points

0 comments1 min readLW link

Does mindfulness meditation lead to awakening?

Vadim Golub7 Jan 2026 14:08 UTC

16 points

0 comments4 min readLW link

An interactive toy model for exploring AI’s effect on the labour market

CharlesD7 Jan 2026 12:57 UTC

12 points

0 comments7 min readLW link

OpenForecaster: How to train language models for open-ended forecasting?

nikhilchandak, shash42 and bayesian_kitten

7 Jan 2026 11:03 UTC

10 points

1 comment7 min readLW link

ML research directions for preventing catastrophic data poisoning

Tom Davidson7 Jan 2026 10:16 UTC

35 points

1 comment10 min readLW link

(newsletter.forethought.org)

A Loser’s Reflections

L.M.Sherlock7 Jan 2026 7:15 UTC

9 points

12 comments18 min readLW link

(lmsherlock.substack.com)

Algorithmic Dating

denzit7 Jan 2026 2:39 UTC

−2 points

0 comments3 min readLW link

(denzit.substack.com)

Simple summary of AI Safety laws

Joseph Miller, Espedair Street and PauseAI UK

7 Jan 2026 1:51 UTC

46 points

4 comments3 min readLW link

Results: A self-randomized study of the impacts of glycine on sleep (Science is still hard)

thedissonance.net6 Jan 2026 20:54 UTC

6 points

1 comment3 min readLW link

(thedissonance.net)

My 2003 Post on the Evolutionary Argument for AI Misalignment

Wei Dai6 Jan 2026 20:45 UTC

37 points

7 comments2 min readLW link

Mainstream approach for alignment evals is a dead end

Igor Ivanov6 Jan 2026 19:52 UTC

60 points

9 comments5 min readLW link

Fertility Roundup #6: The Art of More Dakka

Zvi6 Jan 2026 19:50 UTC

32 points

5 comments26 min readLW link

(thezvi.wordpress.com)

On Owning Galaxies

Simon Lermen6 Jan 2026 18:16 UTC

154 points

62 comments3 min readLW link

(simonlermen.substack.com)

How hard is it to inoculate against misalignment generalization?

Jozdien6 Jan 2026 17:30 UTC

46 points

4 comments14 min readLW link

How AI Is Learning to Think in Secret

Nicholas Andresen6 Jan 2026 16:31 UTC

382 points

32 comments18 min readLW link

(nickandresen.substack.com)

Should you be posting on the open internet

zef6 Jan 2026 15:50 UTC

22 points

9 comments2 min readLW link

Catching misreporting about ML hardware use by turning noise into signal—Part II

Naci Cankaya6 Jan 2026 12:38 UTC

8 points

0 comments1 min readLW link

(nacicankaya.substack.com)

Meditations on Moloch in the AI Rat Race

Alexander Müller6 Jan 2026 9:46 UTC

11 points

1 comment6 min readLW link

[Question] Is anyone doing a real-world test of agentic misalignment?

Jamie Milton Freestone6 Jan 2026 7:45 UTC

2 points

1 comment1 min readLW link

Do we need sparsity afterall?

Giuseppe Birardi6 Jan 2026 6:06 UTC

20 points

5 comments29 min readLW link

Exploring Reinforcement Learning Effects on Chain-of-Thought Legibility

Julian H, RohanS, Baram Sosis, vedant-badoni and The-Turtle

6 Jan 2026 3:04 UTC

41 points

3 comments21 min readLW link

The Evolution Argument Sucks

peralice6 Jan 2026 2:32 UTC

30 points

6 comments8 min readLW link

Festival Stats 2025

jefftk6 Jan 2026 1:40 UTC

10 points

1 comment1 min readLW link

(www.jefftk.com)

Oversight Assistants: Turning Compute into Understanding

jsteinhardt6 Jan 2026 0:50 UTC

85 points

7 comments9 min readLW link

(bounded-regret.ghost.io)

Aether is hiring technical AI safety researchers

Rauno Arike, RohanS and Shubhorup Biswas

5 Jan 2026 22:27 UTC

22 points

0 comments2 min readLW link

[Question] Continual Learning Achieved?

PeterMcCluskey5 Jan 2026 22:22 UTC

−7 points

11 comments1 min readLW link

AGI will not be one specific system, it’ll be the unity of all systems

henophilia5 Jan 2026 18:21 UTC

−4 points

0 comments11 min readLW link

How to tame a complex system

jasoncrawford5 Jan 2026 18:20 UTC

27 points

0 comments2 min readLW link

(newsletter.rootsofprogress.org)

Broadening the training set for alignment

Seth Herd5 Jan 2026 17:30 UTC

40 points

11 comments9 min readLW link

Dos Capital

Zvi5 Jan 2026 16:40 UTC

71 points

10 comments17 min readLW link

(thezvi.wordpress.com)

Announcing the CLR Fundamentals Program

Tristan Cook5 Jan 2026 15:16 UTC

12 points

0 comments2 min readLW link

AI Risk timelines: 10% chance (by year X) should be the headline (and deadline), not 50%. And 10% is _this year_!

Greg C5 Jan 2026 11:57 UTC

61 points

18 comments1 min readLW link

Transformers, Intuitively

atharva5 Jan 2026 11:34 UTC

5 points

0 comments4 min readLW link

The Technology of Liberalism

L Rudolf L5 Jan 2026 11:04 UTC

41 points

7 comments29 min readLW link

(www.nosetgauge.com)

Axiological Stopsigns

JenniferRM5 Jan 2026 7:30 UTC

34 points

6 comments16 min readLW link

Artifical Expert/Expanded Narrow Intelligence, and Proto-AGI

Yuli_Ban5 Jan 2026 3:40 UTC

15 points

0 comments7 min readLW link

An Aphoristic Overview of Technical AI Alignment proposals

wassname5 Jan 2026 3:01 UTC

11 points

3 comments2 min readLW link

Claude Wrote Me a 400-Commit RSS Reader App

Brendan Long5 Jan 2026 2:52 UTC

35 points

11 comments3 min readLW link

(www.brendanlong.com)

The inaugural Redwood Research podcast

Buck and ryan_greenblatt

4 Jan 2026 22:11 UTC

146 points

10 comments142 min readLW link

LessOnline 2026 Improvement Ideas

nomagicpill4 Jan 2026 21:56 UTC

16 points

0 comments1 min readLW link

The economy is a graph, not a pipeline

anithite4 Jan 2026 21:48 UTC

33 points

10 comments4 min readLW link

Calling all college students (and new readers)

neo4 Jan 2026 21:20 UTC

15 points

0 comments1 min readLW link

Rock bottom terminal value

ihatenumbersinusernames74 Jan 2026 20:43 UTC

4 points

9 comments2 min readLW link

In My Misanthropy Era

jenn4 Jan 2026 18:34 UTC

352 points

153 comments8 min readLW link

(jenn.site)