RSS

An­nals of Coun­ter­fac­tual Han

GenericModel12 Dec 2025 1:11 UTC
30 points
0 comments6 min readLW link
(enrichedjamsham.substack.com)

Does dis­solv­ing new­comb’s para­dox mat­ter?

Srdjan Miletic12 Dec 2025 1:06 UTC
9 points
2 comments2 min readLW link
(www.dissent.blog)

De­sign­ing the World’s Safest AI based on Mo­ral­ity Models & Vipassana

shanzson11 Dec 2025 23:33 UTC
−5 points
0 comments21 min readLW link

Anhedoniapolis

Alex Beyman11 Dec 2025 21:53 UTC
13 points
0 comments77 min readLW link

ASI Already Knows About Tor­ture—In Defense of Talk­ing Openly About S-Risks

KatWoods11 Dec 2025 21:15 UTC
−7 points
0 comments2 min readLW link

Cog­ni­tive Tech from Al­gorith­mic In­for­ma­tion Theory

Cole Wyeth11 Dec 2025 20:32 UTC
31 points
6 comments1 min readLW link

Weird Gen­er­al­iza­tion & In­duc­tive Backdoors

11 Dec 2025 18:18 UTC
88 points
0 comments8 min readLW link

The tree, the fly, the ant, the dog, the farmer and the businessman

Alexandre Variengien11 Dec 2025 17:56 UTC
13 points
2 comments5 min readLW link
(alexandrevariengien.com)

Ships in the Night – A Short Story

Dhruv Sumathi11 Dec 2025 17:11 UTC
7 points
0 comments29 min readLW link

My AGI safety re­search—2025 re­view, ’26 plans

Steven Byrnes11 Dec 2025 17:05 UTC
87 points
1 comment12 min readLW link

Think­ing through a lens of physiology

Vadim Golub11 Dec 2025 16:55 UTC
1 point
0 comments7 min readLW link

If Any­one Builds It Every­one Dies, an­other semi-out­sider review

manueldelrio11 Dec 2025 15:43 UTC
50 points
6 comments8 min readLW link

North Sen­tine­lese Post-Singularity

Cleo Nardo11 Dec 2025 14:57 UTC
52 points
21 comments1 min readLW link

Sea snails in a co­caine vaccine

Alexandre Variengien11 Dec 2025 14:22 UTC
9 points
0 comments2 min readLW link

Re­sources for parents

Viliam11 Dec 2025 10:46 UTC
17 points
7 comments2 min readLW link

Stegano­graphic Chains of Thought Are Low-Prob­a­bil­ity but High-Stakes: Ev­i­dence and Arguments

artkpv11 Dec 2025 7:40 UTC
6 points
1 comment6 min readLW link

Brain-in­spired LLM alignment

mtaran11 Dec 2025 3:08 UTC
11 points
1 comment3 min readLW link

Seven Per­spec­tives on LLMs

GenericModel11 Dec 2025 2:11 UTC
12 points
1 comment12 min readLW link
(enrichedjamsham.substack.com)

Some ev­i­dence against the idea strange CoT stems from in­cen­tives to com­press language

williawa10 Dec 2025 22:43 UTC
12 points
0 comments2 min readLW link

Rock Paper Scis­sors is Not Solved, In Practice

Linch10 Dec 2025 21:37 UTC
45 points
11 comments9 min readLW link
(inchpin.substack.com)