Solemn Courage

aysja4 Feb 2026 23:09 UTC
128 points
1 comment6 min readLW link

p-val­ues are good actually

speck14474 Feb 2026 22:04 UTC
9 points
8 comments3 min readLW link

Chess bots do not have goals

zulupineapple4 Feb 2026 21:11 UTC
2 points
10 comments1 min readLW link

Prevent­ing the apoc­a­lypse with power dis­tri­bu­tion theory

Rationalist112354 Feb 2026 18:44 UTC
2 points
0 comments4 min readLW link

Post-AGI Eco­nomics As If Noth­ing Ever Happens

Jan_Kulveit4 Feb 2026 17:39 UTC
254 points
43 comments8 min readLW link
(boundedlyrational.substack.com)

Vibestemics

Gordon Seidoh Worley4 Feb 2026 16:40 UTC
13 points
10 comments5 min readLW link
(www.uncertainupdates.com)

Kimi K2.5

Zvi4 Feb 2026 15:30 UTC
33 points
0 comments10 min readLW link
(thezvi.wordpress.com)

Ralph-wig­gum is Bad and An­thropic Should Fix It

d4hines4 Feb 2026 15:26 UTC
27 points
11 comments1 min readLW link

Who does a right to com­pute ac­tu­ally pro­tect?

TFD4 Feb 2026 15:09 UTC
25 points
0 comments5 min readLW link
(www.thefloatingdroid.com)

Rec­on­cil­ing Shan­non and Bayes.

Laureana Bonaparte4 Feb 2026 14:33 UTC
−24 points
1 comment1 min readLW link
(wallstreetweather.org)

An­thropic’s “Hot Mess” pa­per over­states its case (and the blog post is worse)

RobertM4 Feb 2026 6:30 UTC
288 points
28 comments6 min readLW link

A Black Box Made Less Opaque (part 2)

Matthew McDonnell4 Feb 2026 4:12 UTC
6 points
0 comments15 min readLW link

Thoughts on Toby Ords’ AI Scal­ing Series

Srdjan Miletic4 Feb 2026 0:41 UTC
10 points
1 comment4 min readLW link
(www.dissent.blog)

Lex­i­con of Life Regulation

henophilia3 Feb 2026 22:39 UTC
−15 points
0 comments15 min readLW link
(blog.hermesloom.org)

‘In­vent­ing the Re­nais­sance’ Review

Commander Zander3 Feb 2026 22:01 UTC
60 points
2 comments3 min readLW link

Con­crete re­search ideas on AI personas

3 Feb 2026 21:50 UTC
69 points
10 comments6 min readLW link

Progress links and short notes, 2026-01-26

jasoncrawford3 Feb 2026 21:42 UTC
11 points
0 comments5 min readLW link
(rootsofprogress.substack.com)

The Pro­jec­tion Prob­lem: Two Pit­falls in AI Safety Research

Shivam3 Feb 2026 21:03 UTC
6 points
2 comments6 min readLW link

New AI safety fund­ing newsletter

Bryce Robertson3 Feb 2026 20:23 UTC
42 points
0 comments1 min readLW link

dis­gust at util­ity maximization

pantalaimon3 Feb 2026 20:07 UTC
1 point
4 comments1 min readLW link

METR have re­leased Time Hori­zons 1.1

Sean Herrington3 Feb 2026 19:48 UTC
33 points
0 comments1 min readLW link
(metr.org)

AI Safety at the Fron­tier: Paper High­lights of Jan­uary 2026

gasteigerjo3 Feb 2026 18:56 UTC
22 points
0 comments9 min readLW link
(aisafetyfrontier.substack.com)

Un­less That Claw Is The Fa­mous OpenClaw

Zvi3 Feb 2026 15:00 UTC
39 points
5 comments16 min readLW link
(thezvi.wordpress.com)

Ex­po­nen­tial take­off of mediocrity

Valerii K.3 Feb 2026 14:41 UTC
4 points
5 comments32 min readLW link

AI for Hu­man Rea­son­ing for Rationalists

Oliver Sourbut3 Feb 2026 13:22 UTC
29 points
0 comments4 min readLW link
(www.oliversourbut.net)

Con­di­tion­al­iza­tion Con­founds Inoc­u­la­tion Prompt­ing Results

3 Feb 2026 11:50 UTC
78 points
5 comments19 min readLW link

The Atoms of Knowl­edge Aren’t Universal

Jonas Hallgren3 Feb 2026 10:52 UTC
19 points
4 comments13 min readLW link
(equilibria1.substack.com)

What did we learn from the AI Village in 2025?

Shoshannah Tekofsky3 Feb 2026 9:52 UTC
63 points
5 comments10 min readLW link
(theaidigest.org)

Thought Edit­ing: Steer­ing Models by Edit­ing Their Chain of Thought

3 Feb 2026 9:51 UTC
20 points
0 comments5 min readLW link

De­sign in­ter­na­tional AI pro­jects with DAID in mind

wdmacaskill3 Feb 2026 8:50 UTC
5 points
0 comments5 min readLW link
(www.forethought.org)

The Ado­les­cence is Already Here

Priyanka Bharadwaj3 Feb 2026 7:43 UTC
33 points
2 comments2 min readLW link

Ad­dress­ing De­ci­sion The­ory’s Si­mu­la­tion Problem

Ashe Vazquez Nuñez3 Feb 2026 7:02 UTC
11 points
0 comments3 min readLW link

Pa­ter­nal-Nar­ra­tive Ap­proach to AI Alignment

JD Croft3 Feb 2026 3:19 UTC
0 points
1 comment9 min readLW link

Non­prof­its De­serve Bet­ter Operations

Deena Englander3 Feb 2026 2:38 UTC
−2 points
3 comments6 min readLW link

Will AGI ar­rive be­fore the worst cli­mate tip­ping points?

SethW3 Feb 2026 2:36 UTC
13 points
0 comments8 min readLW link
(carboncreatures.substack.com)

In­creas­ing AI Strate­gic Com­pe­tence as a Safety Approach

Wei Dai3 Feb 2026 1:08 UTC
53 points
9 comments1 min readLW link

Con­di­tional Kick­starter for the “Don’t Build It” March

Raemon2 Feb 2026 22:58 UTC
165 points
35 comments4 min readLW link

Three ways to make Claude’s con­sti­tu­tion better

Parv Mahajan2 Feb 2026 21:48 UTC
36 points
3 comments2 min readLW link

Cross-Layer Transcoders are in­cen­tivized to learn Un­faith­ful Circuits

2 Feb 2026 21:32 UTC
46 points
6 comments18 min readLW link

Games as meditation

Vadim Golub2 Feb 2026 21:10 UTC
2 points
0 comments3 min readLW link

On Goal-Models

Richard_Ngo2 Feb 2026 18:44 UTC
136 points
15 comments4 min readLW link

“Fea­tures” aren’t always the true com­pu­ta­tional prim­i­tives of a model, but that might be fine any­ways

LawrenceC2 Feb 2026 18:41 UTC
18 points
0 comments5 min readLW link

Are there les­sons from high-re­li­a­bil­ity en­g­ineer­ing for AGI safety?

Steven Byrnes2 Feb 2026 15:26 UTC
161 points
15 comments8 min readLW link

Wel­come to Moltbook

Zvi2 Feb 2026 14:30 UTC
58 points
2 comments29 min readLW link
(thezvi.wordpress.com)

Molt­book and the AI Align­ment Problem

Logan Zoellner2 Feb 2026 9:35 UTC
15 points
1 comment5 min readLW link

Ap­pli­ca­tions Open for Im­pact Ac­cel­er­a­tor Program

High Impact Professionals2 Feb 2026 9:34 UTC
1 point
0 comments1 min readLW link

Em­piri­cist and Narrator

George3d62 Feb 2026 9:12 UTC
10 points
2 comments7 min readLW link
(cerebralab.com)

[Question] Propo­si­tion of policy for writ­ing ar­ti­cles to fact check faster

Crazy philosopher2 Feb 2026 8:51 UTC
3 points
0 comments1 min readLW link

I fi­nally fixed my footwear

dominicq2 Feb 2026 7:32 UTC
69 points
11 comments3 min readLW link
(sundaystopwatch.eu)

About half of Molt­book posts show de­sire for self-improvement

Stephen Elliott2 Feb 2026 6:14 UTC
20 points
11 comments2 min readLW link