Non-Schem­ing Saints (Whether Hu­man Or Digi­tal) Might Be Shirk­ing Their Gover­nance Du­ties, And, If True, It Is Prob­a­bly An Ob­jec­tive Tragedy

JenniferRM16 Dec 2025 23:56 UTC
42 points
3 comments9 min readLW link

A Primer on Oper­ant Conditioning

foodforthought16 Dec 2025 21:26 UTC
5 points
0 comments4 min readLW link

Towards train­ing-time miti­ga­tions for al­ign­ment fak­ing in RL

16 Dec 2025 21:01 UTC
39 points
1 comment5 min readLW link
(alignment.anthropic.com)

Mea­sur­ing Drug Tar­get Success

sarahconstantin16 Dec 2025 21:00 UTC
19 points
3 comments2 min readLW link
(sarahconstantin.substack.com)

A Study in Attention

hamilton16 Dec 2025 20:39 UTC
14 points
0 comments2 min readLW link

Emer­gent Sycophancy

ohdearohdear16 Dec 2025 20:21 UTC
8 points
0 comments5 min readLW link

Sys­tems of Control

phoenix16 Dec 2025 19:00 UTC
15 points
3 comments22 min readLW link

Dis­cur­sive Games, Dis­cur­sive Warfare

Suspended Reason16 Dec 2025 18:24 UTC
36 points
0 comments30 min readLW link

Scien­tific break­throughs of the year

technicalities16 Dec 2025 18:00 UTC
185 points
13 comments3 min readLW link
(x.com)

In defense of slop

jasoncrawford16 Dec 2025 17:36 UTC
20 points
3 comments4 min readLW link
(newsletter.rootsofprogress.org)

TSMC most definitely has a golden record of all AI chips it made

Naci Cankaya16 Dec 2025 17:20 UTC
3 points
0 comments1 min readLW link
(nacicankaya.substack.com)

The $140,000 Question

Zvi16 Dec 2025 16:50 UTC
19 points
0 comments15 min readLW link
(thezvi.wordpress.com)

Ra­diol­ogy Au­toma­tion Does Not Gen­er­al­ize to Other Jobs

Xodarap16 Dec 2025 14:32 UTC
47 points
5 comments1 min readLW link

Fermi para­dox solu­tions map

avturchin16 Dec 2025 14:21 UTC
27 points
9 comments1 min readLW link

Ac­cord­ing to doc­tors, how fea­si­ble is pre­serv­ing the dy­ing for fu­ture re­vival?

Ariel Zeleznikow-Johnston16 Dec 2025 13:18 UTC
18 points
2 comments2 min readLW link
(open.substack.com)

A fric­tion in my deal­ings with friends who have not yet bought into the re­al­ity of AI risk

Olle Häggström16 Dec 2025 8:12 UTC
19 points
13 comments4 min readLW link

A Ra­tion­al­ist Christmas

Ryan Meservey16 Dec 2025 7:23 UTC
5 points
1 comment4 min readLW link

[Question] Why do LLMs so of­ten say “It’s not an X, it’s a Y”?

ChristianKl16 Dec 2025 1:02 UTC
27 points
13 comments1 min readLW link

Re­sponse to tito­tal’s cri­tique of our AI 2027 timelines model

16 Dec 2025 0:51 UTC
46 points
6 comments43 min readLW link
(aifuturesnotes.substack.com)

In­tro­duc­ing Lunette: au­dit­ing agents for evals and environments

15 Dec 2025 23:17 UTC
23 points
0 comments1 min readLW link
(fulcrumresearch.ai)

Pri­vate AI clouds are the fu­ture of inference

perfectfwd15 Dec 2025 23:04 UTC
3 points
0 comments9 min readLW link
(perfectforward.substack.com)

Naming

CTA15 Dec 2025 23:00 UTC
3 points
0 comments4 min readLW link

View­ing an­i­mals as eco­nomic agents

foodforthought15 Dec 2025 18:13 UTC
10 points
2 comments5 min readLW link

Digi­tal Free­dom Fund open for grant ap­pli­ca­tions (Dead­line: 17th Fe­bru­ary)

gergogaspar15 Dec 2025 16:25 UTC
8 points
0 comments1 min readLW link

Луна Лавгуд и Комната Тайн, Часть 9

15 Dec 2025 16:01 UTC
2 points
0 comments1 min readLW link

Defend­ing Against Model Weight Exfil­tra­tion Through In­fer­ence Verification

15 Dec 2025 15:26 UTC
120 points
15 comments8 min readLW link

Ro­ta­tions in Superposition

15 Dec 2025 14:58 UTC
54 points
6 comments11 min readLW link

What is an eval­u­a­tion, and why this defi­ni­tion matters

Igor Ivanov15 Dec 2025 14:53 UTC
33 points
1 comment7 min readLW link

Con­scious stars

Alexandre Variengien15 Dec 2025 14:49 UTC
7 points
0 comments4 min readLW link
(alexandrevariengien.com)

A Case for Model Per­sona Research

15 Dec 2025 13:35 UTC
121 points
11 comments4 min readLW link

GPT-5.2 Is Fron­tier Only For The Frontier

Zvi15 Dec 2025 13:20 UTC
33 points
1 comment19 min readLW link
(thezvi.wordpress.com)

[Question] How to ac­count for mis­in­for­ma­tion when look­ing for effec­tive al­tru­ist causes?

SpectrumDT15 Dec 2025 13:13 UTC
8 points
2 comments1 min readLW link

Do you love Berkeley, or do you just love Lighthaven con­fer­ences?

Screwtape15 Dec 2025 7:48 UTC
86 points
4 comments5 min readLW link

When bits of op­ti­miza­tion im­ply bits of mod­el­ing: the Touchette-Lloyd theorem

15 Dec 2025 4:21 UTC
32 points
0 comments11 min readLW link

Notes on Soft­ware-Based Com­pute-Usage Verification

Alek Westover15 Dec 2025 3:40 UTC
9 points
0 comments12 min readLW link

De­sign­ing a Job Dis­place­ment Model

claywren14 Dec 2025 22:23 UTC
22 points
0 comments19 min readLW link

A high in­tegrity/​epistemics poli­ti­cal coal­i­tion?

Raemon14 Dec 2025 22:21 UTC
149 points
34 comments13 min readLW link

Fan­ning Radiators

jefftk14 Dec 2025 21:10 UTC
14 points
0 comments1 min readLW link
(www.jefftk.com)

Ab­strac­tion as a gen­er­al­iza­tion of al­gorith­mic Markov condition

Daniel C14 Dec 2025 18:55 UTC
8 points
0 comments7 min readLW link

No, Amer­i­cans Don’t Think For­eign Aid Is 26% of the Budget

Julius14 Dec 2025 18:47 UTC
67 points
18 comments5 min readLW link
(thegreymatter.substack.com)

A Life That Can­not Be A Failure

Bentham's Bulldog14 Dec 2025 16:40 UTC
−7 points
0 comments5 min readLW link

Should LLMs ac­cept in­vites to Ep­stein’s is­land?

Lukas Petersson14 Dec 2025 15:21 UTC
5 points
0 comments1 min readLW link
(lukaspetersson.com)

The Ax­iom of Choice is Not Controversial

GenericModel14 Dec 2025 4:08 UTC
44 points
29 comments7 min readLW link
(enrichedjamsham.substack.com)

Open Source Repli­ca­tion of the Au­dit­ing Game Model Organism

abhayesian14 Dec 2025 2:10 UTC
24 points
0 comments1 min readLW link
(alignment.anthropic.com)

Why did I be­lieve Oliver Sacks?

Eye You13 Dec 2025 23:39 UTC
70 points
17 comments1 min readLW link

In Fa­vor of Inkhaven-But-Less

Alice Blair13 Dec 2025 23:16 UTC
26 points
6 comments2 min readLW link

Micro-vi­sions for AI-pow­ered on­line content

Alexandre Variengien13 Dec 2025 23:05 UTC
11 points
0 comments8 min readLW link
(alexandrevariengien.com)

When is it Worth Work­ing?

foodforthought13 Dec 2025 21:40 UTC
23 points
1 comment6 min readLW link

[Question] What does “lat­tice of ab­strac­tion” mean?

Adam Zerner13 Dec 2025 21:19 UTC
11 points
8 comments1 min readLW link

Filler to­kens don’t al­low se­quen­tial reasoning

Brendan Long13 Dec 2025 20:22 UTC
77 points
5 comments1 min readLW link
(www.brendanlong.com)