Why your sports car isn’t a race­car (trade­offs ev­ery­where)

Ruby22 Nov 2025 23:23 UTC
29 points
0 comments5 min readLW link

As­sorted Thoughts on “Pivot­ing” to AI

Trevor Hill-Hand22 Nov 2025 21:17 UTC
12 points
1 comment4 min readLW link

OpenAI Locks Down San Fran­cisco Offices Fol­low­ing Alleged Threat From Activist

Matrice Jacobine22 Nov 2025 19:33 UTC
40 points
0 comments4 min readLW link
(www.wired.com)

Sorry, I still think kid­ney dona­tion makes no sense for an EA

nicholashalden22 Nov 2025 18:10 UTC
6 points
4 comments1 min readLW link
(substack.com)

Au­to­matic alt text generation

TurnTrout22 Nov 2025 17:57 UTC
27 points
1 comment1 min readLW link
(turntrout.com)

My frus­tra­tions: AI doom

Dentosal22 Nov 2025 14:59 UTC
2 points
0 comments2 min readLW link

In­tro­spec­tion in LLMs: A Pro­posal For How To Think About It, And Test For It

Christopher Ackerman22 Nov 2025 14:52 UTC
23 points
4 comments7 min readLW link

AI Red Lines: A Re­search Agenda

Charbel-Raphaël22 Nov 2025 8:41 UTC
30 points
1 comment5 min readLW link

Book Re­view: Wizard’s Hall

Screwtape22 Nov 2025 7:38 UTC
96 points
4 comments5 min readLW link

Be Naughty

habryka22 Nov 2025 6:35 UTC
99 points
11 comments4 min readLW link

Mar­ket Logic I

abramdemski22 Nov 2025 6:01 UTC
36 points
2 comments5 min readLW link

The AI 2027 Re­port Is Not Backed Up by Evidence

Oscar Davies22 Nov 2025 5:23 UTC
−17 points
9 comments4 min readLW link

LLM Sys­tems for Liter­a­ture-Based Scien­tific Discovery

Carly Turini22 Nov 2025 4:48 UTC
1 point
0 comments1 min readLW link

An­i­mal welfare con­cerns are dom­i­nated by post-ASI futures

RobertM22 Nov 2025 4:08 UTC
28 points
1 comment4 min readLW link

Ha­bit­ual men­tal mo­tions might ex­plain why peo­ple are con­tent to get old and die

Ruby22 Nov 2025 2:52 UTC
19 points
1 comment7 min readLW link

D&D.Sci Thanks­giv­ing: the Fes­ti­val Feast

aphyer22 Nov 2025 2:26 UTC
41 points
15 comments2 min readLW link

Di­plo­macy dur­ing AI takeoff

Nikola Jurkovic22 Nov 2025 2:12 UTC
18 points
3 comments2 min readLW link
(nikolajurkovic.substack.com)

Ab­stract ad­vice to re­searchers tack­ling the difficult core prob­lems of AGI alignment

TsviBT22 Nov 2025 0:53 UTC
130 points
10 comments8 min readLW link

Easy Op­por­tu­nity to Help Many Animals

Bentham's Bulldog21 Nov 2025 23:03 UTC
10 points
0 comments1 min readLW link

Why Not Just Train For In­ter­pretabil­ity?

johnswentworth21 Nov 2025 22:08 UTC
56 points
12 comments4 min readLW link

Com­plain­ing about my in­abil­ity to fo­cus on un­in­ter­est­ing things

Dentosal21 Nov 2025 20:34 UTC
5 points
3 comments2 min readLW link

Models not mak­ing it clear when they’re role­play­ing seems like a fairly big issue

williawa21 Nov 2025 20:23 UTC
16 points
3 comments6 min readLW link

Nat­u­ral Emer­gent Misal­ign­ment from Re­ward Hacking

Algon21 Nov 2025 20:20 UTC
12 points
0 comments3 min readLW link
(www.anthropic.com)

Nat­u­ral emer­gent mis­al­ign­ment from re­ward hack­ing in pro­duc­tion RL

21 Nov 2025 20:00 UTC
258 points
32 comments9 min readLW link

Eight Heuris­tics of Anti-Epistemology

Ben Pace21 Nov 2025 19:54 UTC
44 points
2 comments6 min readLW link

We won’t solve post-al­ign­ment prob­lems by do­ing research

MichaelDickens21 Nov 2025 18:03 UTC
24 points
11 comments4 min readLW link

Can Ar­tifi­cial In­tel­li­gence Be Con­scious?

Bentham's Bulldog21 Nov 2025 16:43 UTC
15 points
5 comments7 min readLW link

Gem­ini 3: Model Card and Safety Frame­work Report

Zvi21 Nov 2025 16:40 UTC
33 points
0 comments11 min readLW link
(thezvi.wordpress.com)

Lorxus Does Halfhaven: 11/​15~11/​21

Lorxus21 Nov 2025 16:07 UTC
7 points
0 comments1 min readLW link
(tiled-with-pentagons.blogspot.com)

EA Ho­tel Solstice

plex21 Nov 2025 15:13 UTC
8 points
0 comments1 min readLW link

Why Does Em­pa­thy Have an Off-Switch?

J Bostock21 Nov 2025 14:56 UTC
9 points
1 comment7 min readLW link

What Do We Tell the Hu­mans? Er­rors, Hal­lu­ci­na­tions, and Lies in the AI Village

Shoshannah Tekofsky21 Nov 2025 14:19 UTC
56 points
0 comments9 min readLW link

URGENT @ev­ery­one—help us kill AI pre­emp­tion (again) be­fore this Friday

21 Nov 2025 12:51 UTC
−1 points
0 comments1 min readLW link

Should I Ap­ply to a 3.5% Ac­cep­tance-Rate Fel­low­ship? A Sim­ple EV Calculator

Tobias H21 Nov 2025 10:59 UTC
16 points
0 comments5 min readLW link

Towards Hu­man­ist Superintelligence

Chris_Leong21 Nov 2025 10:22 UTC
17 points
3 comments1 min readLW link
(microsoft.ai)

16 Writ­ing Tips from Inkhaven

dreeves21 Nov 2025 7:49 UTC
13 points
1 comment2 min readLW link

Read­ing My Diary: 10 Years Since CFAR

Ben Pace21 Nov 2025 7:27 UTC
71 points
1 comment6 min readLW link

The Wor­ry­ing Na­ture of Akrasia

Notelrac21 Nov 2025 7:00 UTC
2 points
0 comments4 min readLW link

10 Key In­sights from the “Fron­tier AI Risk Mon­i­tor­ing Plat­form”

Weibing Wang21 Nov 2025 6:07 UTC
3 points
0 comments2 min readLW link

Con­tra Col­listeru: You Get About One Carthage

Screwtape21 Nov 2025 5:33 UTC
36 points
2 comments5 min readLW link

In­finites­i­mally False

21 Nov 2025 4:57 UTC
55 points
16 comments12 min readLW link

Prefer­ences are confusing

RobertM21 Nov 2025 3:07 UTC
28 points
1 comment2 min readLW link

Can ques­tions rigidly des­ig­nate in­ten­tions?

Mason Broxham21 Nov 2025 2:00 UTC
1 point
0 comments5 min readLW link

Week 3: Ad­ver­sar­ial Robustness

Ely Hahami21 Nov 2025 1:43 UTC
1 point
0 comments3 min readLW link

In­formed Con­sent as the Sole Cri­te­rion for Med­i­cal Treatment

Character#273621 Nov 2025 1:39 UTC
7 points
2 comments4 min readLW link

Suicide Preven­tion Ought To Be Illegal

Character#273621 Nov 2025 1:39 UTC
−17 points
17 comments6 min readLW link

How you got RL’d into your idiosyn­cratic cognition

Ruby21 Nov 2025 1:06 UTC
16 points
6 comments6 min readLW link

PSA: For Chronic In­fec­tions, Check Teeth

Algon20 Nov 2025 23:14 UTC
15 points
2 comments1 min readLW link

[Paper] Out­put Su­per­vi­sion Can Obfus­cate the CoT

20 Nov 2025 22:41 UTC
92 points
3 comments5 min readLW link
(arxiv.org)

The Bor­ing Part of Bell Labs

Elizabeth20 Nov 2025 22:40 UTC
133 points
0 comments15 min readLW link
(acesounderglass.com)