5 Jan 2026 22:27 UTC

22 points

0 comments2 min readLW link

[Question] Continual Learning Achieved?

PeterMcCluskey5 Jan 2026 22:22 UTC

−7 points

11 comments1 min readLW link

AGI will not be one specific system, it’ll be the unity of all systems

henophilia5 Jan 2026 18:21 UTC

−4 points

0 comments11 min readLW link

How to tame a complex system

jasoncrawford5 Jan 2026 18:20 UTC

27 points

0 comments2 min readLW link

(newsletter.rootsofprogress.org)

Broadening the training set for alignment

Seth Herd5 Jan 2026 17:30 UTC

40 points

11 comments9 min readLW link

Dos Capital

Zvi5 Jan 2026 16:40 UTC

71 points

10 comments17 min readLW link

(thezvi.wordpress.com)

Announcing the CLR Fundamentals Program

Tristan Cook5 Jan 2026 15:16 UTC

12 points

0 comments2 min readLW link

AI Risk timelines: 10% chance (by year X) should be the headline (and deadline), not 50%. And 10% is _this year_!

Greg C5 Jan 2026 11:57 UTC

61 points

18 comments1 min readLW link

Transformers, Intuitively

atharva5 Jan 2026 11:34 UTC

5 points

0 comments4 min readLW link

The Technology of Liberalism

L Rudolf L5 Jan 2026 11:04 UTC

41 points

7 comments29 min readLW link

(www.nosetgauge.com)

Axiological Stopsigns

JenniferRM5 Jan 2026 7:30 UTC

34 points

6 comments16 min readLW link

Artifical Expert/Expanded Narrow Intelligence, and Proto-AGI

Yuli_Ban5 Jan 2026 3:40 UTC

15 points

0 comments7 min readLW link

An Aphoristic Overview of Technical AI Alignment proposals

wassname5 Jan 2026 3:01 UTC

11 points

3 comments2 min readLW link

Claude Wrote Me a 400-Commit RSS Reader App

Brendan Long5 Jan 2026 2:52 UTC

35 points

11 comments3 min readLW link

(www.brendanlong.com)

The inaugural Redwood Research podcast

Buck and ryan_greenblatt

4 Jan 2026 22:11 UTC

146 points

10 comments142 min readLW link

LessOnline 2026 Improvement Ideas

nomagicpill4 Jan 2026 21:56 UTC

16 points

0 comments1 min readLW link

The economy is a graph, not a pipeline

anithite4 Jan 2026 21:48 UTC

33 points

10 comments4 min readLW link

Calling all college students (and new readers)

neo4 Jan 2026 21:20 UTC

15 points

0 comments1 min readLW link

Rock bottom terminal value

ihatenumbersinusernames74 Jan 2026 20:43 UTC

4 points

9 comments2 min readLW link

In My Misanthropy Era

jenn4 Jan 2026 18:34 UTC

352 points

153 comments8 min readLW link

(jenn.site)

The Thinking Machine

PeterMcCluskey4 Jan 2026 18:24 UTC

36 points

0 comments2 min readLW link

(bayesianinvestor.com)

The Maduro Polymarket bet is not “obviously insider trading”

ceselder4 Jan 2026 10:53 UTC

22 points

18 comments3 min readLW link

The Problem with Democracy

RandStrauss4 Jan 2026 7:11 UTC

−3 points

3 comments2 min readLW link

Examples of Subtle Alignment Failures from Claude and Gemini

Tachikoma4 Jan 2026 4:29 UTC

−9 points

1 comment5 min readLW link

Four Downsides of Training Policies Online

Alek Westover and egan

4 Jan 2026 3:17 UTC

29 points

4 comments3 min readLW link

Humanity’s Gambit

Ben Ihrig4 Jan 2026 3:08 UTC

5 points

5 comments3 min readLW link

Semantic Topological Spaces

TristanTrim4 Jan 2026 0:58 UTC

11 points

16 comments5 min readLW link

The surprising adequacy of the Roblox game marketplace

Esteban Restrepo3 Jan 2026 14:15 UTC

26 points

3 comments8 min readLW link

(papabos.substack.com)

Re: Anthropic Chinese Cyber-Attack. How Do We Protect Open-source Models?

Mayowa Osibodu3 Jan 2026 9:45 UTC

−1 points

2 comments6 min readLW link

Give Skepticism a Try

Ape in the coat3 Jan 2026 8:57 UTC

12 points

17 comments3 min readLW link

(apeinthecoat102771.substack.com)

Why We Should Talk Specifically Amid Uncertainty

sbaumohl3 Jan 2026 3:04 UTC

11 points

1 comment7 min readLW link

Companies as “proto-ASI”

beyarkay (Boyd Kane)3 Jan 2026 0:24 UTC

15 points

3 comments1 min readLW link

(boydkane.com)

AXRP Episode 47 - David Rein on METR Time Horizons

DanielFilan3 Jan 2026 0:10 UTC

21 points

0 comments46 min readLW link

The Weirdness of Dating/Mating: Deep Nonconsent Preference

johnswentworth2 Jan 2026 23:05 UTC

12 points

61 comments6 min readLW link

Can AI learn human societal norms from social feedback (without recapitulating all the ways this has failed in human history?)

foodforthought2 Jan 2026 22:11 UTC

7 points

3 comments4 min readLW link

Fertility Roundup #5: Causation

Zvi2 Jan 2026 22:00 UTC

19 points

5 comments25 min readLW link

(thezvi.wordpress.com)

Scale-Free Goodness

testingthewaters2 Jan 2026 21:00 UTC

10 points

3 comments5 min readLW link

(aclevername.substack.com)

Does developmental cognitive psychology provide any hints for making model alignment more robust?

foodforthought2 Jan 2026 20:31 UTC

7 points

0 comments3 min readLW link

Does evolution provide any hints for making model alignment more robust?

foodforthought2 Jan 2026 19:06 UTC

5 points

0 comments4 min readLW link

Where do AI Safety Fellows go? Analyzing a dataset of 600+ alumni

Christopher_Clay2 Jan 2026 18:14 UTC

20 points

2 comments5 min readLW link

(forum.effectivealtruism.org)

Instruct Vectors—Base models can be instruct with activation vectors

Eriskii2 Jan 2026 18:14 UTC

21 points

0 comments8 min readLW link

[Advanced Intro to AI Alignment] 2. What Values May an AI Learn? — 4 Key Problems

Towards_Keeperhood2 Jan 2026 14:51 UTC

33 points

10 comments19 min readLW link

2025 Letter

zef2 Jan 2026 13:57 UTC

10 points

0 comments14 min readLW link

(zephyyr.substack.com)

2025 in AI predictions

jessicata2 Jan 2026 4:29 UTC

245 points

19 comments11 min readLW link

Debunking claims about subquadratic attention

Vladimir Ivanov2 Jan 2026 4:23 UTC

32 points

5 comments3 min readLW link

The bio-pirate’s guide to GLP-1 agonists

quiet_NaN2 Jan 2026 3:32 UTC

40 points

3 comments5 min readLW link

College Was Not That Terrible Now That I’m Not That Crazy

Zack_M_Davis1 Jan 2026 23:14 UTC

90 points

9 comments44 min readLW link

(zackmdavis.net)

Taiwan war timelines might be shorter than AI timelines

Baram Sosis1 Jan 2026 22:30 UTC

108 points

21 comments5 min readLW link

Split (Part 1)

Shoshannah Tekofsky1 Jan 2026 22:29 UTC

27 points

2 comments4 min readLW link

(shoshanigans.substack.com)

[Question] Who is responsible for shutting down rogue AI?

Cole Wyeth1 Jan 2026 21:36 UTC

45 points

2 comments1 min readLW link