No silver bul­let: Les­sons about how to cre­ate safety from the his­tory of fire

jasoncrawford26 Jan 2026 22:18 UTC
28 points
1 comment7 min readLW link
(newsletter.rootsofprogress.org)

List­ing the virtues from Claude’s “Con­sti­tu­tion”

David Gross26 Jan 2026 22:16 UTC
20 points
5 comments2 min readLW link

A Ra­tional Proposal

Archie Chaudhury26 Jan 2026 20:22 UTC
−2 points
0 comments14 min readLW link

Dario Amodei – The Ado­les­cence of Technology

habryka26 Jan 2026 19:10 UTC
147 points
62 comments73 min readLW link
(www.darioamodei.com)

Dialogue: Is there a Nat­u­ral Ab­strac­tion of Good?

26 Jan 2026 18:40 UTC
74 points
12 comments29 min readLW link

AlgZoo: un­in­ter­preted mod­els with fewer than 1,500 parameters

Jacob_Hilton26 Jan 2026 17:30 UTC
181 points
7 comments10 min readLW link
(www.alignment.org)

Aero­drop: far-UVC lamp giveaway

Austin Chen26 Jan 2026 17:03 UTC
34 points
0 comments2 min readLW link
(aerodrop.org)

Claude’s Con­sti­tu­tional Structure

Zvi26 Jan 2026 15:40 UTC
55 points
3 comments17 min readLW link
(thezvi.wordpress.com)

What ac­tu­ally mat­ters in neu­rotech star­tups (and what doesn’t)

Abhishaike Mahajan26 Jan 2026 15:19 UTC
37 points
0 comments16 min readLW link
(www.owlposting.com)

Rea­son­ing Long Jump: Why we shouldn’t rely on CoT mon­i­tor­ing for interpretability

tobypullan26 Jan 2026 10:10 UTC
9 points
2 comments6 min readLW link

Eons of Utopia

ceselder26 Jan 2026 9:26 UTC
13 points
0 comments2 min readLW link
(ceselder.substack.com)

Sun­ny­vale EA/​LW/​ACX meetup

wolverdude26 Jan 2026 7:16 UTC
2 points
1 comment1 min readLW link

The ‘Peo­ple Pleaser’ Prob­lem in LLMs

Kinsey Kappler26 Jan 2026 5:06 UTC
−7 points
2 comments1 min readLW link

Futarchy is Par­a­sitic on What It Tries to Govern

Nicolas Rasmont26 Jan 2026 5:05 UTC
11 points
2 comments12 min readLW link

Ada Palmer: In­vent­ing the Renaissance

Martin Sustrik26 Jan 2026 4:40 UTC
301 points
22 comments13 min readLW link
(www.250bpm.com)

I (well, mostly claude code) simu­lated pro­por­tional rep­re­sen­ta­tion meth­ods.

Charlie Steiner26 Jan 2026 4:23 UTC
11 points
1 comment4 min readLW link

How to do a digi­tal declutter

mingyuan26 Jan 2026 4:12 UTC
21 points
0 comments5 min readLW link
(mingyuan.substack.com)

Can you just vibe vuln­er­a­bil­ities?

Max von Hippel26 Jan 2026 3:07 UTC
19 points
5 comments5 min readLW link

Up­com­ing Dove­tail fel­low talks & discussion

Alex_Altair26 Jan 2026 2:39 UTC
29 points
0 comments3 min readLW link

Chan­nelguessr: A Dis­cord game

Brendan Long25 Jan 2026 23:17 UTC
8 points
0 comments1 min readLW link
(www.brendanlong.com)

[Question] How ac­cu­rate a model of the re­friger­a­tion cy­cle is this doo­dle?

Optimization Process25 Jan 2026 22:09 UTC
14 points
5 comments1 min readLW link

The Possessed Machines (sum­mary)

L Rudolf L25 Jan 2026 20:47 UTC
128 points
31 comments9 min readLW link
(possessedmachines.com)

Notable Progress Has Been Made in Whole Brain Emulation

Dom Polsinelli25 Jan 2026 19:07 UTC
103 points
15 comments6 min readLW link
(open.substack.com)

[Question] Are There Effec­tive In­ter­ven­tions to In­crease Distress Tol­er­ance?

simeon_c25 Jan 2026 18:50 UTC
9 points
1 comment1 min readLW link

Canada Lost Its Measles Elimi­na­tion Sta­tus Be­cause We Don’t Have Enough Nurses Who Speak Low German

jenn25 Jan 2026 18:33 UTC
325 points
24 comments7 min readLW link
(www.jenn.site)

To be well-cal­ibrated is to be punctual

moridinamael25 Jan 2026 18:10 UTC
97 points
17 comments2 min readLW link

A tale of three the­o­ries: spar­sity, frus­tra­tion, and statis­ti­cal field theory

Dmitry Vaintrob25 Jan 2026 18:09 UTC
63 points
0 comments18 min readLW link

Rein­vent­ing the wheel

dr_s25 Jan 2026 11:56 UTC
28 points
3 comments5 min readLW link

Cri­tique of ma­chine unlearning

myyycroft25 Jan 2026 10:50 UTC
2 points
0 comments5 min readLW link

Towards Sub-agent Dy­nam­ics and Con­flict

Ashe Vazquez Nuñez25 Jan 2026 5:27 UTC
13 points
1 comment3 min readLW link

The Vir­tual Mother-in-Law

Priyanka Bharadwaj25 Jan 2026 5:14 UTC
11 points
12 comments2 min readLW link

De­clin­ing Marginal Costs of Alienation

Celer25 Jan 2026 4:40 UTC
18 points
1 comment4 min readLW link
(keller.substack.com)

Struc­ture and func­tion of the hip­pocam­pal CA3 module

Devin Ward25 Jan 2026 1:57 UTC
4 points
0 comments1 min readLW link

What’s a good method­ol­ogy for “is Trump un­usual about ex­ec­u­tive over­reach /​ in­sti­tu­tion ero­sion /​ cor­rup­tion?”

25 Jan 2026 1:35 UTC
53 points
60 comments3 min readLW link

Clawed Abode: Claude Code is Too Cloudy

Brendan Long25 Jan 2026 0:15 UTC
13 points
2 comments2 min readLW link
(www.brendanlong.com)

Skill: cog­ni­tive black box flight recorder

TsviBT24 Jan 2026 22:54 UTC
27 points
2 comments5 min readLW link

In Defense of Memorization

David Goodman24 Jan 2026 22:49 UTC
24 points
7 comments13 min readLW link

Think­ing from the Other Side: Should I Wash My Hair with Sham­poo?

R0sberg24 Jan 2026 22:47 UTC
6 points
1 comment2 min readLW link

Small lan­guage mod­els hal­lu­ci­nate know­ing some­thing’s off.

Toheed24 Jan 2026 22:46 UTC
12 points
0 comments5 min readLW link

IABIED Book Re­view: Core Ar­gu­ments and Counterarguments

Stephen McAleese24 Jan 2026 14:25 UTC
90 points
39 comments25 min readLW link

The Global AI Dataset (GAID) Pro­ject: From Clos­ing Re­search Gaps to Build­ing Re­spon­si­ble and Trust­wor­thy AI

Jason Hung24 Jan 2026 3:23 UTC
7 points
0 comments15 min readLW link

A Black Box Made Less Opaque (part 1)

Matthew McDonnell24 Jan 2026 3:20 UTC
6 points
0 comments12 min readLW link

A Sim­ple Method for Ac­cel­er­at­ing Grokking

josh :)24 Jan 2026 3:19 UTC
14 points
1 comment3 min readLW link

Who is choos­ing your prefer­ences- You or your Mind?

shanzson24 Jan 2026 3:17 UTC
0 points
4 comments1 min readLW link

How I Used Method­able to Have a Nice Tuesday

dnsosebee24 Jan 2026 2:57 UTC
4 points
0 comments10 min readLW link

AI X-Risk Bot­tle­neck = Ad­vo­cacy?

fortytwo24 Jan 2026 2:52 UTC
10 points
0 comments1 min readLW link

Every Bench­mark is Broken

Jonathan Gabor24 Jan 2026 2:42 UTC
95 points
0 comments4 min readLW link
(jonathanpgabor.substack.com)

Thou­sand Year Old Ad­vice on Relin­quish­ing Con­trol to AI

Dom Polsinelli24 Jan 2026 2:20 UTC
−3 points
2 comments3 min readLW link
(dompols.substack.com)

AI Must Learn to Po­lice Itself

savant23 Jan 2026 22:39 UTC
1 point
0 comments2 min readLW link

Con­den­sa­tion & Relevance

abramdemski23 Jan 2026 22:21 UTC
38 points
0 comments5 min readLW link