In­so­far As I Think LLMs “Don’t Really Un­der­stand Things”, What Do I Mean By That?

johnswentworth8 Nov 2025 23:37 UTC
90 points
15 comments3 min readLW link

Why AC is cheap, but AC re­pair is a luxury

Annapurna8 Nov 2025 23:01 UTC
3 points
0 comments1 min readLW link
(a16z.substack.com)

My­opia Mythology

abramdemski8 Nov 2025 22:22 UTC
38 points
3 comments3 min readLW link

Om­nis­cal­ing to MNIST

cloud8 Nov 2025 19:42 UTC
100 points
3 comments10 min readLW link

Can Models be Eval­u­a­tion Aware Without Ex­plicit Ver­bal­iza­tion?

8 Nov 2025 18:26 UTC
26 points
10 comments8 min readLW link

Cake vs Lack of Cake

Notelrac8 Nov 2025 18:10 UTC
1 point
0 comments2 min readLW link

Un­ex­pected Things that are People

Ben Goldhaber8 Nov 2025 17:12 UTC
209 points
11 comments4 min readLW link

A hu­man­ist cri­tique of tech­nolog­i­cal determinism

8 Nov 2025 15:27 UTC
10 points
0 comments6 min readLW link

Five very good rea­sons to not write down liter­ally ev­ery sin­gle thought you have

ceselder8 Nov 2025 10:22 UTC
18 points
2 comments4 min readLW link

Re­view: Par­si­fal at the SF Opera

Adam Scherlis8 Nov 2025 8:25 UTC
10 points
0 comments6 min readLW link
(adam.scherl.is)

Es­ca­la­tion and per­cep­tion

TsviBT8 Nov 2025 8:12 UTC
69 points
0 comments12 min readLW link

The Snaw

Screwtape8 Nov 2025 6:42 UTC
23 points
5 comments2 min readLW link

Au­gus­tine of Hippo’s Hand­book on Faith, Hope, and Love in Latin (or: Claude as Pan­doc++)

DanielFilan8 Nov 2025 6:31 UTC
8 points
2 comments1 min readLW link
(danielfilan.com)

Com­par­ing Payor & Löb

abramdemski8 Nov 2025 5:40 UTC
54 points
1 comment3 min readLW link

Mourn­ing a life with­out AI

Nikola Jurkovic8 Nov 2025 4:44 UTC
194 points
63 comments6 min readLW link
(nikolajurkovic.substack.com)

Two Times I Was Sur­prised By My Own Values

johnswentworth8 Nov 2025 3:56 UTC
13 points
1 comment3 min readLW link

The solu­tion to akra­sia ap­par­ently isn’t not hav­ing any goals

Dentosal8 Nov 2025 3:53 UTC
1 point
0 comments3 min readLW link

An­thropic & Dario’s dream

Simon Lermen8 Nov 2025 1:19 UTC
55 points
1 comment5 min readLW link

Against “You can just do things”

zroe18 Nov 2025 0:58 UTC
61 points
9 comments3 min readLW link

Agent Foun­da­tions: Paradig­ma­tiz­ing in Math and Science

TristanTrim8 Nov 2025 0:37 UTC
3 points
0 comments9 min readLW link

From Ther­mo­dy­nam­ics to Sora: A Com­pre­hen­sive In­tro­duc­tion to Denois­ing Diffu­sion for Video Generation

phenomanon7 Nov 2025 23:36 UTC
5 points
0 comments15 min readLW link

Pythia

plex7 Nov 2025 23:31 UTC
99 points
31 comments4 min readLW link

Start an AI safety group with the Path­fin­der Fellowship

Topaz7 Nov 2025 21:05 UTC
2 points
0 comments1 min readLW link

AI is not in­evitable.

David Scott Krueger7 Nov 2025 20:31 UTC
29 points
2 comments3 min readLW link
(therealartificialintelligence.substack.com)

An­nounc­ing “Com­pu­ta­tional Func­tion­al­ism De­bate” (so­lic­it­ing paid feed­back): Test your in­tu­itions about consciousness

ChrisPercy7 Nov 2025 20:12 UTC
4 points
0 comments3 min readLW link

The Hawley-Blu­men­thal AI Risk Eval­u­a­tion Act

David Abecassis7 Nov 2025 19:09 UTC
42 points
0 comments2 min readLW link
(techgov.intelligence.org)

Sec­u­lar Sols­tice Roundup 2025

datawitch7 Nov 2025 19:03 UTC
14 points
4 comments1 min readLW link

The Decalogue For Aligned AI.

theophilus tabuke7 Nov 2025 18:47 UTC
1 point
0 comments1 min readLW link

An­a­lyt­i­cal Val­i­da­tion of Bio­mark­ers is Not the Full Story

mnarayan7 Nov 2025 18:39 UTC
9 points
0 comments2 min readLW link
(blog.neurostats.org)

A coun­try of alien idiots in a dat­a­cen­ter: AI progress and pub­lic alarm

Seth Herd7 Nov 2025 16:56 UTC
94 points
17 comments11 min readLW link

Bologna ACX/​LW meetup

Luca Petrolati7 Nov 2025 16:55 UTC
2 points
0 comments1 min readLW link

On Sam Alt­man’s Se­cond Con­ver­sa­tion with Tyler Cowen

Zvi7 Nov 2025 16:40 UTC
15 points
3 comments30 min readLW link
(thezvi.wordpress.com)

Plans to build AGI with nu­clear re­ac­tor-like safety lack ‘sys­tem­atic think­ing,’ say researchers

Mordechai Rorvig7 Nov 2025 16:25 UTC
−1 points
2 comments1 min readLW link
(www.foommagazine.org)

13 Ar­gu­ments About a Tran­si­tion to Neu­ralese AIs

Rauno Arike7 Nov 2025 16:19 UTC
49 points
14 comments10 min readLW link

Open Let­ter to Ohio House Reps

Stephen Martin7 Nov 2025 16:05 UTC
16 points
6 comments3 min readLW link

Two easy digi­tal in­ten­tion­al­ity practices

mingyuan7 Nov 2025 15:11 UTC
38 points
2 comments2 min readLW link
(mingyuan.substack.com)

Is it re­ally para­noia if I’m re­ally Out to Get Me?

Dentosal7 Nov 2025 8:28 UTC
0 points
0 comments3 min readLW link

Did you know you can just buy black­belts?

Screwtape7 Nov 2025 7:47 UTC
22 points
12 comments4 min readLW link

GPTF-8: A to­k­enizer-based char­ac­ter encoding

Adam Scherlis7 Nov 2025 7:47 UTC
7 points
0 comments3 min readLW link
(adam.scherl.is)

Cancer; A Crime Story (and other tales of op­ti­miza­tion gone wrong)

Jonas Hallgren7 Nov 2025 7:09 UTC
19 points
2 comments12 min readLW link

A scheme to credit hack policy gra­di­ent training

Adrià Garriga-alonso7 Nov 2025 6:24 UTC
15 points
0 comments5 min readLW link

[CS2881r][Week 8] When Agents Pre­fer Hack­ing To Failure: Eval­u­at­ing Misal­ign­ment Un­der Pressure

7 Nov 2025 5:45 UTC
2 points
0 comments23 min readLW link

Liber­a­tion Clippy

abramdemski7 Nov 2025 5:21 UTC
17 points
2 comments1 min readLW link

Min­i­miz­ing Loss ≠ Max­i­miz­ing Intelligence

Taylor G. Lunt7 Nov 2025 4:15 UTC
7 points
2 comments9 min readLW link

Sols­tice Sea­son 2025: Ri­tual Roundup & Megamee­tups

Raemon7 Nov 2025 3:58 UTC
58 points
16 comments3 min readLW link

My new non­profit Evitable is hiring.

David Scott Krueger7 Nov 2025 3:39 UTC
74 points
4 comments1 min readLW link

Willpower is ex­haust­ing, use con­tent blockers

mingyuan7 Nov 2025 2:20 UTC
28 points
3 comments4 min readLW link
(mingyuan.substack.com)

A re­view of MSUM’s AI In­no­va­tion Sum­mit: Day Two

Philipreal7 Nov 2025 1:10 UTC
2 points
0 comments6 min readLW link

Bru­tal­ist Prose

Sinclair Chen7 Nov 2025 0:59 UTC
8 points
0 comments3 min readLW link

Can we do use­ful meta-anal­y­sis? Un­jour­nal eval­u­a­tions of “Mean­ingfully re­duc­ing con­sump­tion of meat… is an un­solved prob­lem...”

david reinstein7 Nov 2025 0:40 UTC
2 points
0 comments1 min readLW link
(forum.effectivealtruism.org)