All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

ASI Already Knows About Torture—In Defense of Talking Openly About S-Risks

KatWoods11 Dec 2025 21:15 UTC

−9 points

0 comments2 min readLW link

Cognitive Tech from Algorithmic Information Theory

Cole Wyeth11 Dec 2025 20:32 UTC

43 points

9 comments1 min readLW link

Announcing Progress in Medicine, a high school summer career exploration program

jasoncrawford11 Dec 2025 18:33 UTC

8 points

0 comments3 min readLW link

(rootsofprogress.org)

Weird Generalization & Inductive Backdoors

Jorio Cocola, Owain_Evans and Dylan Feng

11 Dec 2025 18:18 UTC

153 points

8 comments8 min readLW link

The tree, the fly, the ant, the dog, the farmer and the businessman

Alexandre Variengien11 Dec 2025 17:56 UTC

14 points

2 comments5 min readLW link

(alexandrevariengien.com)

Ships in the Night – A Short Story

Dhruv Sumathi11 Dec 2025 17:11 UTC

15 points

0 comments29 min readLW link

My AGI safety research—2025 review, ’26 plans

Steven Byrnes11 Dec 2025 17:05 UTC

137 points

4 comments12 min readLW link

Thinking through a lens of physiology

Vadim Golub11 Dec 2025 16:55 UTC

1 point

0 comments7 min readLW link

If Anyone Builds It Everyone Dies, another semi-outsider review

manueldelrio11 Dec 2025 15:43 UTC

50 points

18 comments8 min readLW link

North Sentinelese Post-Singularity

Cleo Nardo11 Dec 2025 14:57 UTC

78 points

40 comments1 min readLW link

Flock – work in public with friends (beta testers wanted)

henryaj11 Dec 2025 14:23 UTC

4 points

0 comments1 min readLW link

AI #146: Chipping In

Zvi11 Dec 2025 14:22 UTC

42 points

6 comments44 min readLW link

(thezvi.wordpress.com)

Sea snails in a cocaine vaccine

Alexandre Variengien11 Dec 2025 14:22 UTC

16 points

0 comments2 min readLW link

Resources for parents

Viliam11 Dec 2025 10:46 UTC

21 points

9 comments2 min readLW link

Steganographic Chains of Thought Are Low-Probability but High-Stakes: Evidence and Arguments

Artem Karpov11 Dec 2025 7:40 UTC

20 points

1 comment6 min readLW link

Systems Analysis: AI Alignment and the Principal-Agent Problem

NelsonDP11 Dec 2025 3:48 UTC

1 point

0 comments24 min readLW link

Brain-inspired LLM alignment

mtaran11 Dec 2025 3:08 UTC

13 points

1 comment3 min readLW link

Seven Perspectives on LLMs

GenericModel11 Dec 2025 2:11 UTC

20 points

1 comment12 min readLW link

(enrichedjamsham.substack.com)

MIRI Comms is hiring

Duncan Sabien (Inactive)11 Dec 2025 0:46 UTC

81 points

0 comments3 min readLW link

Some evidence against the idea strange CoT stems from incentives to compress language

williawa10 Dec 2025 22:43 UTC

17 points

0 comments2 min readLW link

Follow-through on Bay Solstice

Raemon10 Dec 2025 22:07 UTC

106 points

22 comments6 min readLW link

Rock Paper Scissors is Not Solved, In Practice

Linch10 Dec 2025 21:37 UTC

59 points

13 comments9 min readLW link

(inchpin.substack.com)

Childhood and Education #15: Got To Get Out

Zvi10 Dec 2025 21:31 UTC

49 points

3 comments26 min readLW link

(thezvi.wordpress.com)

Apply to ESPR & PAIR 2026, Rationality and AI Camps for Ages 16-21

Stag10 Dec 2025 19:39 UTC

25 points

0 comments1 min readLW link

Evaluation as a (Cooperation-Enabling?) Tool

VojtaKovarik10 Dec 2025 18:54 UTC

18 points

0 comments28 min readLW link

Consider calling the NY governor about the RAISE Act

thenoviceoof10 Dec 2025 18:47 UTC

16 points

0 comments11 min readLW link

No ghost in the machine

fin10 Dec 2025 18:35 UTC

10 points

5 comments45 min readLW link

(finmoorhouse.com)

Most Algorithmic Progress is Data Progress [Linkpost]

Noosphere8910 Dec 2025 17:48 UTC

36 points

9 comments5 min readLW link

(www.beren.io)

Fibonacci Holds Information

milanrosko10 Dec 2025 17:16 UTC

11 points

2 comments2 min readLW link

Register for SPAR Demo Day on Saturday, Dec 13

Topaz and agucova

10 Dec 2025 16:58 UTC

7 points

0 comments1 min readLW link

We don’t know what most microbial genes do. Can genomic language models help?

Abhishaike Mahajan10 Dec 2025 16:04 UTC

19 points

0 comments1 min readLW link

Artifacts I’d like to try

Alexandre Variengien10 Dec 2025 14:16 UTC

15 points

5 comments6 min readLW link

(alexandrevariengien.com)

AI Safety – Analyse Affordances

atharva10 Dec 2025 14:09 UTC

3 points

0 comments2 min readLW link

An Approach for Evaluating Self-Boundary Consistency in AI Systems

Anurag 10 Dec 2025 13:57 UTC

3 points

0 comments6 min readLW link

Caesar Derangement Syndrome

GenericModel10 Dec 2025 13:04 UTC

−6 points

3 comments6 min readLW link

(enrichedjamsham.substack.com)

Living on a ball of hair

Alexandre Variengien10 Dec 2025 7:38 UTC

4 points

0 comments1 min readLW link

(alexandrevariengien.com)

The funding conversation we left unfinished

jenn10 Dec 2025 2:17 UTC

151 points

3 comments3 min readLW link

[Question] Do you expect the first AI to cross NY’s RAISE Act’s “Critical Harm” threshold to be contained?

Josh Snider10 Dec 2025 1:04 UTC

4 points

0 comments1 min readLW link

TT Self Study Journal # 5

TristanTrim9 Dec 2025 22:16 UTC

4 points

2 comments5 min readLW link

Lorxus Does Halfhaven: 11/29, 11/30, Highlights, Postmortem

Lorxus9 Dec 2025 21:00 UTC

6 points

0 comments3 min readLW link

(tiled-with-pentagons.blogspot.com)

Tristan’s list of things to write

TristanTrim9 Dec 2025 20:28 UTC

5 points

21 comments1 min readLW link

Tate Modern 2150

GenericModel9 Dec 2025 19:15 UTC

15 points

2 comments9 min readLW link

(enrichedjamsham.substack.com)

Selling H200s to China Is Unwise and Unpopular

Zvi9 Dec 2025 19:11 UTC

47 points

3 comments13 min readLW link

(thezvi.wordpress.com)

Non-optimized beauty

Alexandre Variengien9 Dec 2025 19:04 UTC

7 points

0 comments3 min readLW link

(alexandrevariengien.com)

Auditing Games for Sandbagging [paper]

Jordan Taylor and Joseph Bloom

9 Dec 2025 18:37 UTC

103 points

4 comments10 min readLW link

A Catalog of AI Evaluations

Anurag 9 Dec 2025 17:05 UTC

2 points

0 comments1 min readLW link

Insights into Claude Opus 4.5 from Pokémon

Julian Bradshaw9 Dec 2025 16:57 UTC

222 points

24 comments10 min readLW link

Localizing Finetuned Information in Transformers with Dynamic Weight Grafting

toddknife9 Dec 2025 16:20 UTC

6 points

0 comments5 min readLW link

Gradual Disempowerment Monthly Roundup #3

Raymond Douglas9 Dec 2025 16:02 UTC

49 points

0 comments4 min readLW link

Every house has a chemistry lab

Alexandre Variengien9 Dec 2025 14:17 UTC

5 points

0 comments1 min readLW link

(alexandrevariengien.com)