The­o­ret­i­cal pre­dic­tions on the sam­ple effi­ciency of train­ing poli­cies and ac­ti­va­tion monitors

10 Jan 2026 23:50 UTC
18 points
2 comments7 min readLW link

If AI al­ign­ment is only as hard as build­ing the steam en­g­ine, then we likely still die

MichaelDickens10 Jan 2026 23:10 UTC
35 points
8 comments4 min readLW link

How Hu­man­ity Wins

Wes R10 Jan 2026 21:55 UTC
−20 points
10 comments4 min readLW link

Pos­si­ble Prin­ci­ples of Superagency

Mariven10 Jan 2026 21:00 UTC
14 points
0 comments12 min readLW link
(mariven.substack.com)

The Case Against Con­tin­u­ous Chain-of-Thought (Neu­ralese)

RobinHa10 Jan 2026 20:32 UTC
11 points
8 comments5 min readLW link

The false con­fi­dence the­o­rem and Bayesian reasoning

viking_math10 Jan 2026 17:14 UTC
24 points
19 comments6 min readLW link

A Pro­posal for a Bet­ter ARENA: Shift­ing from Teach­ing to Re­search Sprints

TheManxLoiner10 Jan 2026 16:56 UTC
28 points
15 comments6 min readLW link

Mo­ral-Epistemic Scrupu­los­ity: A Cross-Frame­work Failure Mode of Truth-Seeking

Tamara Sofía Falcone10 Jan 2026 2:24 UTC
17 points
2 comments8 min readLW link

Find­ing high sig­nal peo­ple—ap­ply­ing PageRank to Twitter

jfguan10 Jan 2026 2:21 UTC
27 points
0 comments3 min readLW link
(thefourierproject.org)

AI In­ci­dent Forecasting

cluebbers10 Jan 2026 2:17 UTC
8 points
0 comments1 min readLW link
(cluebbers.github.io)

6’7” Is Not Random

Martin Lichstam10 Jan 2026 2:13 UTC
−10 points
2 comments2 min readLW link

What do we mean by “im­pos­si­ble”?

Sniffnoy10 Jan 2026 0:01 UTC
23 points
3 comments2 min readLW link

Where’s the $100k iPhone?

beyarkay (Boyd Kane)9 Jan 2026 23:48 UTC
33 points
32 comments4 min readLW link
(boydkane.com)

Tak­ing LLMs Se­ri­ously (As Lan­guage Models)

abramdemski9 Jan 2026 23:23 UTC
57 points
9 comments17 min readLW link

FirstPrin­ci­ples Talks: Science in the Age of AI

Carly Turini9 Jan 2026 21:18 UTC
1 point
0 comments1 min readLW link

Ob­jec­tive Questions

JenniferRM9 Jan 2026 21:09 UTC
23 points
6 comments8 min readLW link

FirstPrin­ci­ples Talks: Shal­low Re­cur­rent De­coders for the Au­to­mated Dis­cov­ery of Phys­i­cal Models

Carly Turini9 Jan 2026 21:06 UTC
1 point
0 comments1 min readLW link

Cancer-Selec­tive, Pan-Essen­tial Tar­gets from DepMap

sarahconstantin9 Jan 2026 20:50 UTC
21 points
0 comments11 min readLW link
(sarahconstantin.substack.com)

Un­der­stand­ing com­plex con­ju­gates in quan­tum mechanics

jessicata9 Jan 2026 20:45 UTC
17 points
8 comments12 min readLW link
(unstableontology.com)

[Linkpost] On the Ori­gins of Al­gorith­mic Progress in AI

alex_fogelson9 Jan 2026 18:41 UTC
47 points
6 comments1 min readLW link
(open.substack.com)

Claude Codes

Zvi9 Jan 2026 17:10 UTC
75 points
7 comments20 min readLW link
(thezvi.wordpress.com)

Leo in me

Rudaiba9 Jan 2026 15:55 UTC
−1 points
4 comments1 min readLW link

[Question] Another Cost Disease? We are all cap­i­tal­ists now

Oliver Sourbut9 Jan 2026 13:07 UTC
16 points
11 comments2 min readLW link

Align­ment Fak­ing is a Lin­ear Fea­ture in An­thropic’s Hughes Model (Edited 1/​11/​26)

James Hoffend9 Jan 2026 12:03 UTC
34 points
4 comments4 min readLW link

What do peo­ple mean by “re­cur­sive self-im­prove­ment”?

Expertium9 Jan 2026 11:15 UTC
9 points
4 comments1 min readLW link

Pa­ram­e­ters of Me­tacog­ni­tion—The Anes­the­sia Patient

Gunnar_Zarncke9 Jan 2026 1:20 UTC
21 points
0 comments8 min readLW link

I dream ev­ery night now

Declan Molony9 Jan 2026 0:34 UTC
34 points
5 comments4 min readLW link

The Eco­nomics of Trans­for­ma­tive AI

8 Jan 2026 22:22 UTC
64 points
4 comments18 min readLW link
(post-agi.org)

The Hunger Strike To Stop The AI Race

Michaël Trazzi8 Jan 2026 21:05 UTC
37 points
0 comments1 min readLW link
(www.youtube.com)

Skep­ti­cism about In­tro­spec­tion in LLMs

derek shiller8 Jan 2026 20:07 UTC
11 points
7 comments36 min readLW link

On ra­tio­nal­ity skills

dominicq8 Jan 2026 19:23 UTC
3 points
3 comments1 min readLW link
(sundaystopwatch.eu)

Self-Help Tac­tics That Are Work­ing For Me

sarahconstantin8 Jan 2026 18:00 UTC
54 points
2 comments11 min readLW link
(sarahconstantin.substack.com)

Dist­in­guish­ing Sen­sory Qualia Us­ing Neu­ral Structure

Shiva's Right Foot8 Jan 2026 16:45 UTC
4 points
0 comments8 min readLW link

Why LLMs Aren’t Scien­tists Yet.

Dhruv Trehan8 Jan 2026 16:06 UTC
39 points
3 comments5 min readLW link
(arxiv.org)

Can We Make AI Align­ment Fram­ing Less Wrong?

Anurag 8 Jan 2026 15:20 UTC
3 points
0 comments4 min readLW link

AI #150: While Claude Codes

Zvi8 Jan 2026 15:00 UTC
42 points
3 comments20 min readLW link
(thezvi.wordpress.com)

Say­ing What You Want

omegastick8 Jan 2026 14:12 UTC
17 points
0 comments3 min readLW link
(dumbideas.xyz)

Small Steps Towards Prov­ing Stochas­tic → Deter­minis­tic Nat­u­ral Latent

8 Jan 2026 12:27 UTC
57 points
4 comments10 min readLW link

Us­ing Anki to mem­o­rise the names of the MATS 9 cohort

beyarkay (Boyd Kane)8 Jan 2026 4:41 UTC
5 points
0 comments3 min readLW link
(boydkane.com)

Rents Are High, But Not Skyrocketing

jefftk8 Jan 2026 2:40 UTC
29 points
1 comment1 min readLW link
(www.jefftk.com)

The AI In­fras­truc­ture Se­cu­rity Shortlist

Abbey Chaver8 Jan 2026 2:26 UTC
21 points
0 comments6 min readLW link

HIA and X-risk part 2: Why it hurts

TsviBT8 Jan 2026 1:19 UTC
63 points
13 comments21 min readLW link

Beliefs and po­si­tion go­ing into 2026

RussellThor8 Jan 2026 1:11 UTC
5 points
0 comments5 min readLW link

Lu­mina Pro­biotic worked for me!

Eye You8 Jan 2026 0:34 UTC
46 points
3 comments2 min readLW link

Taiwan Trip Report

nomagicpill7 Jan 2026 23:40 UTC
11 points
0 comments9 min readLW link
(nomagicpill.substack.com)

Public in­tel­lec­tu­als need to say what they ac­tu­ally believe

Aaron Bergman7 Jan 2026 21:22 UTC
79 points
12 comments14 min readLW link
(www.aaronbergman.net)

Two Aspects of Si­tu­a­tional Aware­ness: World Model­ling & In­dex­i­cal Information

David Scott Krueger7 Jan 2026 20:24 UTC
40 points
7 comments2 min readLW link

Ad­vance­ments In Self-Driv­ing Cars

Zvi7 Jan 2026 19:50 UTC
30 points
2 comments17 min readLW link
(thezvi.wordpress.com)

Two ways non-U.S. folks can con­tribute to AI go­ing well

Joe Rogero7 Jan 2026 19:37 UTC
21 points
1 comment2 min readLW link
(subatomicarticles.com)

Every­thing is Poli­ti­cal Now, or, A Re­view of “Frag­gle Rock: Back to the Rock”

Gordon Seidoh Worley7 Jan 2026 17:00 UTC
13 points
0 comments8 min readLW link
(www.uncertainupdates.com)