[Event] Build­ing What the Fu­ture Needs: A cu­rated con­fer­ence in Ber­lin (Sep 6, 2025) for high-im­pact builders and researchers

Vasilii Kondyrev8 Aug 2025 23:08 UTC
7 points
0 comments2 min readLW link

Me­mory De­cod­ing Jour­nal Club: The den­dritic engram

Devin Ward8 Aug 2025 22:08 UTC
1 point
0 comments1 min readLW link

Mak­ing Sense of Con­scious­ness Part 4: States of Consciousness

sarahconstantin8 Aug 2025 21:21 UTC
8 points
0 comments5 min readLW link
(sarahconstantin.substack.com)

What would a hu­man pre­tend­ing to be an AI say?

Brendan Long8 Aug 2025 18:56 UTC
53 points
18 comments1 min readLW link
(www.brendanlong.com)

Will morally mo­ti­vated ac­tors steer us to­wards a near-best fu­ture?

wdmacaskill8 Aug 2025 18:32 UTC
22 points
0 comments4 min readLW link

How hard to achieve is eu­topia?

wdmacaskill8 Aug 2025 16:16 UTC
22 points
0 comments7 min readLW link

OpenAI’s GPT-OSS Is Already Old News

Zvi8 Aug 2025 12:20 UTC
39 points
4 comments18 min readLW link
(thezvi.wordpress.com)

Ex­tract-and-Eval­u­ate Mon­i­tor­ing Can Sig­nifi­cantly En­hance CoT Mon­i­tor Perfor­mance (Re­search Note)

8 Aug 2025 10:41 UTC
51 points
7 comments10 min readLW link

The Tor­toise and the Lan­guage Model (A Fable After Hofs­tadter)

mwatkins8 Aug 2025 10:39 UTC
54 points
4 comments3 min readLW link

Closed Mouth, Open Oppurtunities

CstineSublime8 Aug 2025 10:32 UTC
6 points
0 comments4 min readLW link

How an­ti­ci­pa­tory cover-ups go wrong

Kaj_Sotala8 Aug 2025 10:26 UTC
295 points
25 comments6 min readLW link

Strate­gic Moder­a­tion Goals (a Plan B to AI al­ign­ment)

Jim Buhler8 Aug 2025 8:08 UTC
2 points
0 comments3 min readLW link

METR’s Eval­u­a­tion of GPT-5

GradientDissenter7 Aug 2025 22:17 UTC
141 points
15 comments20 min readLW link
(metr.github.io)

ChatGPT is the Da­guerreo­type of AI

Alex_Altair7 Aug 2025 22:14 UTC
42 points
2 comments7 min readLW link

Prin­ci­ples of AI Uncontrollability

WillPetillo7 Aug 2025 21:10 UTC
1 point
0 comments7 min readLW link

Third-or­der cog­ni­tion as a model of su­per­in­tel­li­gence (iron­i­cally: Meta® metacog­ni­tion)

soycarts7 Aug 2025 20:56 UTC
2 points
5 comments13 min readLW link

Yes, Ra­tion­al­ism is a Cult

James Camacho7 Aug 2025 20:43 UTC
−14 points
23 comments4 min readLW link

GPT-5 is out

david reinstein7 Aug 2025 20:33 UTC
4 points
0 comments1 min readLW link
(openai.com)

OpenAI Re­leases GPT-5

anaguma7 Aug 2025 18:41 UTC
18 points
0 comments1 min readLW link
(openai.com)

Balanc­ing ex­plo­ra­tion and re­sis­tance to memetic threats af­ter AGI

Eric Neyman7 Aug 2025 18:03 UTC
26 points
5 comments5 min readLW link

state of the machine

thiccythot7 Aug 2025 17:50 UTC
21 points
5 comments6 min readLW link

Chron­i­cles of the Gen­tle Sin­gu­lar­ity: A Short Story

Ihor Kendiukhov7 Aug 2025 13:50 UTC
21 points
0 comments4 min readLW link

AI #128: Four Hours Un­til Prob­a­bly Not The Apocalypse

Zvi7 Aug 2025 13:00 UTC
34 points
5 comments65 min readLW link
(thezvi.wordpress.com)

No One is Really Working

Annapurna7 Aug 2025 11:19 UTC
5 points
9 comments1 min readLW link
(www.humaninvariant.com)

[Question] An­thropic Is Go­ing All In On Abil­ity Without In­tel­li­gence?

Chapin Lenthall-Cleary7 Aug 2025 5:54 UTC
2 points
0 comments2 min readLW link

Civil Ser­vice: a Vic­tim or a Villain?

Martin Sustrik7 Aug 2025 5:50 UTC
67 points
27 comments4 min readLW link
(www.250bpm.com)

AXRP Epi­sode 46 - Tom David­son on AI-en­abled Coups

DanielFilan7 Aug 2025 5:10 UTC
11 points
0 comments68 min readLW link

A Cheeky Pint with An­thropic CEO Dario Amodei

WilliamKiely7 Aug 2025 3:21 UTC
10 points
3 comments1 min readLW link

Re­pro­duc­ing Ab­solute Zero

Lucy Wingard7 Aug 2025 3:01 UTC
5 points
1 comment4 min readLW link

In­ter­view with Kel­sey Piper on Self-Cen­sor­ship and the Vibe Shift

Zack_M_Davis7 Aug 2025 2:51 UTC
57 points
1 comment15 min readLW link
(unremediatedgender.space)

Forbes: Fear Of Su­per In­tel­li­gent AI Is Driv­ing Har­vard And MIT Stu­dents To Drop Out

Nikola Jurkovic7 Aug 2025 2:02 UTC
19 points
0 comments1 min readLW link
(www.forbes.com)

Open weights != Open source

martinkunev7 Aug 2025 1:04 UTC
0 points
8 comments3 min readLW link

No, Ra­tion­al­ism Is Not a Cult

Liam Robins7 Aug 2025 0:39 UTC
22 points
18 comments10 min readLW link
(thelimestack.substack.com)

Cri­tiquing the Dun­ning-Kruger Effect

Jennifer Young7 Aug 2025 0:36 UTC
0 points
0 comments1 min readLW link

Re: re­cent An­thropic safety research

Eliezer Yudkowsky6 Aug 2025 22:52 UTC
145 points
22 comments5 min readLW link
(x.com)

It’s Owl in the Num­bers: To­ken En­tan­gle­ment in Sublimi­nal Learning

6 Aug 2025 22:18 UTC
38 points
7 comments4 min readLW link

[Question] In­scrutabil­ity was always in­evitable, right?

Steven Byrnes6 Aug 2025 21:57 UTC
99 points
33 comments2 min readLW link

Claude, GPT, and Gem­ini All Strug­gle to Evade Monitors

6 Aug 2025 20:28 UTC
61 points
3 comments5 min readLW link

Opus 4.1 Is An In­cre­men­tal Improvement

Zvi6 Aug 2025 19:50 UTC
46 points
1 comment6 min readLW link
(thezvi.wordpress.com)

My Mis­take, Your Problem

Gordon Seidoh Worley6 Aug 2025 17:41 UTC
9 points
0 comments4 min readLW link
(uncertainupdates.substack.com)

[Question] How use­ful could stolen AI model weights be with­out know­ing the ar­chi­tec­ture and ac­ti­va­tion func­tions?

Jemal Young6 Aug 2025 17:36 UTC
6 points
5 comments1 min readLW link

Statis­ti­cal sug­ges­tions for mech in­terp re­search and beyond

Paul Bogdan6 Aug 2025 12:45 UTC
62 points
4 comments15 min readLW link

In­ves­ti­gat­ing In­ter­nal Rep­re­sen­ta­tions of Cor­rect­ness in SONAR Text Autoencoders

6 Aug 2025 12:13 UTC
5 points
0 comments7 min readLW link

How hard to achieve is eu­topia?

wdmacaskill6 Aug 2025 11:02 UTC
17 points
2 comments7 min readLW link

Love, Lies and Misalignment

Priyanka Bharadwaj6 Aug 2025 9:44 UTC
6 points
1 comment3 min readLW link

My cur­rent guess at the effect of AI au­toma­tion on jobs

sortega6 Aug 2025 8:17 UTC
16 points
6 comments2 min readLW link

Zoom Out: Distri­bu­tions in Se­man­tic Spaces

TristanTrim6 Aug 2025 0:01 UTC
14 points
4 comments4 min readLW link

An opinionated guide to build­ing a good to-do system

bilalchughtai5 Aug 2025 23:00 UTC
23 points
7 comments8 min readLW link
(bilalchughtai.co.uk)

Good Ideas Aren’t Enough in AI Policy

Andersehen5 Aug 2025 22:38 UTC
12 points
0 comments5 min readLW link

The Problem

5 Aug 2025 21:40 UTC
313 points
218 comments26 min readLW link