Con­di­tional Kick­starter for the “Don’t Build It” March

Raemon2 Feb 2026 22:58 UTC
165 points
35 comments4 min readLW link

Three ways to make Claude’s con­sti­tu­tion better

Parv Mahajan2 Feb 2026 21:48 UTC
36 points
3 comments2 min readLW link

Cross-Layer Transcoders are in­cen­tivized to learn Un­faith­ful Circuits

2 Feb 2026 21:32 UTC
46 points
6 comments18 min readLW link

Games as meditation

Vadim Golub2 Feb 2026 21:10 UTC
2 points
0 comments3 min readLW link

On Goal-Models

Richard_Ngo2 Feb 2026 18:44 UTC
136 points
15 comments4 min readLW link

“Fea­tures” aren’t always the true com­pu­ta­tional prim­i­tives of a model, but that might be fine any­ways

LawrenceC2 Feb 2026 18:41 UTC
18 points
0 comments5 min readLW link

Are there les­sons from high-re­li­a­bil­ity en­g­ineer­ing for AGI safety?

Steven Byrnes2 Feb 2026 15:26 UTC
161 points
15 comments8 min readLW link

Wel­come to Moltbook

Zvi2 Feb 2026 14:30 UTC
58 points
2 comments29 min readLW link
(thezvi.wordpress.com)

Molt­book and the AI Align­ment Problem

Logan Zoellner2 Feb 2026 9:35 UTC
15 points
1 comment5 min readLW link

Ap­pli­ca­tions Open for Im­pact Ac­cel­er­a­tor Program

High Impact Professionals2 Feb 2026 9:34 UTC
1 point
0 comments1 min readLW link

Em­piri­cist and Narrator

George3d62 Feb 2026 9:12 UTC
10 points
2 comments7 min readLW link
(cerebralab.com)

[Question] Propo­si­tion of policy for writ­ing ar­ti­cles to fact check faster

Crazy philosopher2 Feb 2026 8:51 UTC
3 points
0 comments1 min readLW link

I fi­nally fixed my footwear

dominicq2 Feb 2026 7:32 UTC
69 points
11 comments3 min readLW link
(sundaystopwatch.eu)

About half of Molt­book posts show de­sire for self-improvement

Stephen Elliott2 Feb 2026 6:14 UTC
20 points
11 comments2 min readLW link

How to pre­vent build­ing a soft­ware-Ultron

PratyushRT2 Feb 2026 6:07 UTC
1 point
0 comments2 min readLW link

The limit­ing fac­tor in AI pro­gram­ming is the syn­chro­niza­tion over­head be­tween two minds

jnalanko2 Feb 2026 6:04 UTC
20 points
3 comments1 min readLW link

Ap­ply­ing Tem­per­a­ture to LLM Out­puts Se­man­ti­cally to Min­imise Low-Tem­per­a­ture Hallucinations

Brodie Eaton2 Feb 2026 6:02 UTC
9 points
0 comments4 min readLW link

Thoughts the Un­rea­son­able Effec­tive­ness of Maths

Srdjan Miletic2 Feb 2026 6:00 UTC
16 points
5 comments4 min readLW link
(www.dissent.blog)

The Smok­ing Le­sion Doesn’t Really Dist­in­guish EDT from CDT

Srdjan Miletic2 Feb 2026 5:57 UTC
14 points
5 comments2 min readLW link
(www.dissent.blog)

Word im­por­tance in text ⇐ con­di­tional in­for­ma­tion of the to­ken in the con­text. Is this as­sump­tion valid?

yun dong2 Feb 2026 5:50 UTC
3 points
3 comments1 min readLW link

The Meta-An­thropic Argument

RogerDearnaley2 Feb 2026 1:10 UTC
41 points
55 comments2 min readLW link

Emo­tions and Reality

small identity1 Feb 2026 22:40 UTC
13 points
1 comment4 min readLW link

Si­tu­a­tional Aware­ness is (mostly) here to stay

atharva1 Feb 2026 21:40 UTC
10 points
0 comments1 min readLW link

Are you look­ing for Nep­tune or Vul­can?

Mati_Roy1 Feb 2026 20:59 UTC
18 points
0 comments1 min readLW link

What It’s Like To Be A Worm (Notes on Border­line Sen­tience)

Niko_McCarty1 Feb 2026 17:33 UTC
18 points
3 comments25 min readLW link
(www.asimov.press)

Differ­en­tially Scary Movies

jefftk1 Feb 2026 14:40 UTC
43 points
1 comment1 min readLW link
(www.jefftk.com)

Would you kill a vul­can to save a shrimp?

James Diacoumis1 Feb 2026 12:46 UTC
10 points
8 comments6 min readLW link
(substack.com)

Do LLMs Learn Our Prefer­ences or Just Our Be­hav­iors?

wassname1 Feb 2026 11:28 UTC
13 points
0 comments1 min readLW link

[Question] Pre­dic­tions of molt­book, crustafar­i­ans, and SOUL.md

Aprillion1 Feb 2026 9:01 UTC
21 points
6 comments1 min readLW link

What would it mean for the My­ers-Briggs per­son­al­ity test to be pseu­do­scien­tific?

Yair Halberstadt1 Feb 2026 8:32 UTC
20 points
11 comments3 min readLW link

How does rea­son­ing af­fect Eth­i­cal/​Mo­ral task re­sults?

Kaustubh Kislay1 Feb 2026 4:49 UTC
9 points
0 comments3 min readLW link

[Question] Whence un­change­able val­ues?

ihatenumbersinusernames71 Feb 2026 3:49 UTC
9 points
6 comments1 min readLW link

Book re­view: Already Free

Thomas Broadley1 Feb 2026 3:14 UTC
21 points
4 comments10 min readLW link
(thomasbroadley.com)

[LINK] Solv­ing scurvy through deus ex machina: How a sci­en­tific the­ory is born

Kotlopou1 Feb 2026 0:45 UTC
10 points
0 comments1 min readLW link
(beatingthehydra.substack.com)

Gra­di­ent-Based Re­cov­ery of Me­morized Diffu­sion Model Data

RobinHa1 Feb 2026 0:05 UTC
10 points
0 comments3 min readLW link

Molt­book shit­posts are ac­tu­ally re­ally funny

Sean Herrington31 Jan 2026 23:34 UTC
51 points
4 comments15 min readLW link

On ‘In­vent­ing Tem­per­a­ture’ and the re­al­ness of properties

DanielFilan31 Jan 2026 23:31 UTC
42 points
8 comments7 min readLW link
(danielfilan.com)

Some thoughts on what would make me en­dorse an AGI lab

Eli Tyre31 Jan 2026 23:14 UTC
42 points
19 comments5 min readLW link

Nick and “Eter­nity”

MarkelKori31 Jan 2026 21:50 UTC
5 points
0 comments8 min readLW link

Hu­mans can post on moltbook

shash4231 Jan 2026 21:06 UTC
24 points
3 comments1 min readLW link

An Ex­pli­ca­tion of Align­ment Optimism

Oliver Daniels31 Jan 2026 20:58 UTC
43 points
22 comments1 min readLW link

Ba­sics of How Not to Die

31 Jan 2026 19:04 UTC
111 points
20 comments4 min readLW link

An Abla­tion Study on the Role of [Un­trans­lat­able] in Co­op­er­a­tive Equil­ibrium For­ma­tion: Emer­gent Ra­tion­al­iza­tion Un­der Miss­ing Primitives

Florian_Dietz31 Jan 2026 18:03 UTC
22 points
5 comments11 min readLW link

Disjunc­tive ar­gu­ments can be a re­verse mul­ti­ple-stage fallacy

TFD31 Jan 2026 15:46 UTC
41 points
6 comments1 min readLW link
(www.thefloatingdroid.com)

Jan­uary 2026 Links

nomagicpill31 Jan 2026 15:14 UTC
9 points
3 comments8 min readLW link
(nomagicpill.substack.com)

If the Su­per­in­tel­li­gence were near fallacy

MP31 Jan 2026 15:04 UTC
22 points
3 comments8 min readLW link

Swiss fi­nan­cial reg­u­la­tor re­signs af­ter blog post from MITx DEDP on­line learner (FINMA, JuristGate, Par­reaux, Thiébaud & Part­ners)

pocock30 Jan 2026 23:53 UTC
2 points
0 comments1 min readLW link

Fore­cast: Re­cur­sively Self-im­prov­ing AI for 2033

CuoreDiVetro30 Jan 2026 23:53 UTC
0 points
0 comments3 min readLW link

Se­nior Re­searcher—MIT AI Risk Initiative

peterslattery30 Jan 2026 23:06 UTC
8 points
0 comments5 min readLW link

36,000 AI Agents Are Now Speedrun­ning Civilization

Michaël Trazzi30 Jan 2026 21:21 UTC
86 points
27 comments1 min readLW link