En­ergy and Ingenuity

datawitch21 Dec 2025 22:22 UTC
9 points
0 comments7 min readLW link

Small Models Can In­tro­spect, Too

vgel21 Dec 2025 22:20 UTC
124 points
8 comments4 min readLW link
(vgel.me)

Two No­tions of a Goal: Tar­get States vs. Suc­cess Metrics

paul_dfr21 Dec 2025 21:28 UTC
10 points
0 comments7 min readLW link

What’s the Cur­rent Stock Mar­ket Bub­ble?

PeterMcCluskey21 Dec 2025 20:08 UTC
48 points
4 comments2 min readLW link
(bayesianinvestor.com)

EA Yale Destiny De­bate Dis­cus­sion:

Nathan Young21 Dec 2025 19:10 UTC
10 points
11 comments1 min readLW link
(www.youtube.com)

Can Claude teach me to make coffee?

philh21 Dec 2025 16:23 UTC
151 points
25 comments16 min readLW link

Ret­ro­spec­tive on Copen­hagen Sec­u­lar Sols­tice 2025

Søren Elverlin21 Dec 2025 15:34 UTC
7 points
0 comments4 min readLW link

Google seem­ingly solved effi­cient attention

ceselder21 Dec 2025 13:54 UTC
26 points
4 comments4 min readLW link

Wit­ness or Wager: En­forc­ing ‘Show Your Work’ in Model Outputs

markacochran21 Dec 2025 13:12 UTC
3 points
2 comments1 min readLW link

Turn­ing 20 in the prob­a­ble pre-apoc­a­lypse

Parv Mahajan21 Dec 2025 10:14 UTC
443 points
66 comments3 min readLW link

Technoromanticism

lsusr21 Dec 2025 9:00 UTC
110 points
20 comments5 min readLW link

Anal­y­sis of Whisper-Tiny Us­ing Sparse Autoencoders

Omar Khursheed21 Dec 2025 8:44 UTC
8 points
0 comments4 min readLW link

Align­ment Pre­train­ing: AI Dis­course Causes Self-Fulfilling (Mis)alignment

21 Dec 2025 0:53 UTC
201 points
25 comments9 min readLW link

The un­rea­son­able deep­ness of num­ber theory

OhadA20 Dec 2025 22:16 UTC
65 points
6 comments9 min readLW link

Digi­tal in­ten­tion­al­ity: What’s the point?

mingyuan20 Dec 2025 21:46 UTC
45 points
7 comments3 min readLW link
(mingyuan.substack.com)

Con­tra­dict my take on OpenPhil’s past AI beliefs

Eliezer Yudkowsky20 Dec 2025 21:15 UTC
197 points
94 comments3 min readLW link

Why the al­chemists couldn’t build rockets

Garrett Baker20 Dec 2025 20:25 UTC
17 points
1 comment2 min readLW link

Ex­per­i­ments to un­der­stand Sin­gu­lar Learn­ing The­ory’s Free En­ergy & Lo­cal Learn­ing Coeffi­cient (LLC)

anish-lakkapragada20 Dec 2025 17:38 UTC
7 points
0 comments6 min readLW link

Chain-of-Thought as Con­tex­tual Sta­bi­liza­tion and As­so­ci­a­tive Retrieval

Aditya Raj20 Dec 2025 17:32 UTC
5 points
1 comment6 min readLW link

How to game the METR plot

shash4220 Dec 2025 13:46 UTC
241 points
32 comments5 min readLW link

No God Can Help You

Ape in the coat20 Dec 2025 8:32 UTC
37 points
0 comments3 min readLW link
(apeinthecoat102771.substack.com)

Claude Opus 4.5 Achieves 50%-Time Hori­zon Of Around 4 hrs 49 Mins

Michaël Trazzi20 Dec 2025 7:13 UTC
92 points
14 comments1 min readLW link

Show LW: Align­ment Scry

Xyra Sinclair20 Dec 2025 2:48 UTC
17 points
4 comments2 min readLW link

Opinionated Takes on Mee­tups Organizing

jenn20 Dec 2025 0:17 UTC
251 points
34 comments9 min readLW link

A Full Epistemic Stack: Knowl­edge Com­mons for the 21st Century

19 Dec 2025 22:48 UTC
46 points
7 comments11 min readLW link
(www.oliversourbut.net)

Opinion Fuzzing: A Pro­posal for Re­duc­ing & Ex­plor­ing Var­i­ance in LLM Judg­ments Via Sampling

ozziegooen19 Dec 2025 21:41 UTC
11 points
0 comments5 min readLW link

Progress links and short notes, 2025-12-19

jasoncrawford19 Dec 2025 19:44 UTC
8 points
0 comments6 min readLW link
(newsletter.rootsofprogress.org)

Linch’s Top Inkhaven Posts and Reflections

Linch19 Dec 2025 19:40 UTC
38 points
0 comments9 min readLW link
(linch.substack.com)

When Were Things The Best?

Zvi19 Dec 2025 18:00 UTC
62 points
16 comments15 min readLW link
(thezvi.wordpress.com)

Re­sponse to In­tro­spec­tive Aware­ness research

maddi19 Dec 2025 17:23 UTC
6 points
0 comments9 min readLW link

SPAR Spring 2026: 130+ re­search pro­jects now ac­cept­ing applications

agucova19 Dec 2025 14:23 UTC
22 points
0 comments2 min readLW link

Space view

kapedalex19 Dec 2025 14:20 UTC
5 points
0 comments6 min readLW link

Digi­tal Minds in 2025: A Year in Review

19 Dec 2025 14:18 UTC
16 points
0 comments21 min readLW link
(digitalminds.substack.com)

Scratchpad

Karthik Tadepalli19 Dec 2025 14:15 UTC
12 points
0 comments4 min readLW link

AI Safety has a scal­ing problem

beyarkay (Boyd Kane)19 Dec 2025 13:58 UTC
34 points
10 comments4 min readLW link

When Are Con­ceal­ment Fea­tures Learned? And Does the Model Know Who’s Watch­ing?

James Hoffend19 Dec 2025 8:19 UTC
13 points
1 comment6 min readLW link

2025-Era “Re­ward Hack­ing” Does Not Show that Re­ward Is the Op­ti­miza­tion Target

TurnTrout19 Dec 2025 6:09 UTC
49 points
9 comments7 min readLW link
(turntrout.com)

Wuck­les!

Raemon19 Dec 2025 3:08 UTC
64 points
15 comments2 min readLW link

Eval­u­a­tion Aware­ness Scales Pre­dictably in Open-Weights Large Lan­guage Models

Maheep Chaudhary19 Dec 2025 2:47 UTC
21 points
0 comments6 min readLW link

A name for the things that AI com­pa­nies are building

DirectedEvolution19 Dec 2025 2:07 UTC
28 points
9 comments4 min readLW link

I made Geneguessr

Brinedew19 Dec 2025 1:55 UTC
35 points
2 comments1 min readLW link

In defence of the hu­man agency: “Cur­ing Cancer” is the new “Think of the Chil­dren”

Rajmohan H19 Dec 2025 0:03 UTC
27 points
9 comments3 min readLW link

Help keep AI un­der hu­man con­trol: Pal­isade Re­search 2026 fundraiser

18 Dec 2025 23:41 UTC
105 points
66 comments6 min readLW link

OpenAI: Sidestep­ping Eval­u­a­tion Aware­ness and An­ti­ci­pat­ing Misal­ign­ment with Pro­duc­tion Evaluations

18 Dec 2025 22:55 UTC
25 points
1 comment1 min readLW link
(alignment.openai.com)

Scal­able End-to-End Interpretability

jsteinhardt18 Dec 2025 22:37 UTC
120 points
3 comments3 min readLW link

My Trip to NeurIPS 2025

Adam Newgas18 Dec 2025 22:31 UTC
15 points
0 comments4 min readLW link
(www.boristhebrave.com)

Lead­ing by example

martinkunev18 Dec 2025 20:30 UTC
3 points
2 comments3 min readLW link

Ac­ti­va­tion Or­a­cles: Train­ing and Eval­u­at­ing LLMs as Gen­eral-Pur­pose Ac­ti­va­tion Explainers

18 Dec 2025 20:21 UTC
154 points
11 comments8 min readLW link
(arxiv.org)

A Study Of Instinct

LoganStrohl18 Dec 2025 20:19 UTC
30 points
0 comments4 min readLW link

Es­ti­mat­ing The Por­tion of In­come Con­sumed By Essen­tials Between 1985 and 2025

Mars_Will_Be_Ours18 Dec 2025 19:19 UTC
2 points
2 comments3 min readLW link
(shoutinginthedarkforest.substack.com)