All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Event] Building What the Future Needs: A curated conference in Berlin (Sep 6, 2025) for high-impact builders and researchers

Vasilii Kondyrev8 Aug 2025 23:08 UTC

7 points

0 comments2 min readLW link

Memory Decoding Journal Club: The dendritic engram

Devin Ward8 Aug 2025 22:08 UTC

1 point

0 comments1 min readLW link

Making Sense of Consciousness Part 4: States of Consciousness

sarahconstantin8 Aug 2025 21:21 UTC

8 points

0 comments5 min readLW link

(sarahconstantin.substack.com)

What would a human pretending to be an AI say?

Brendan Long8 Aug 2025 18:56 UTC

54 points

19 comments1 min readLW link

(www.brendanlong.com)

Will morally motivated actors steer us towards a near-best future?

wdmacaskill8 Aug 2025 18:32 UTC

22 points

0 comments4 min readLW link

How hard to achieve is eutopia?

wdmacaskill8 Aug 2025 16:16 UTC

22 points

0 comments7 min readLW link

OpenAI’s GPT-OSS Is Already Old News

Zvi8 Aug 2025 12:20 UTC

40 points

4 comments18 min readLW link

(thezvi.wordpress.com)

Extract-and-Evaluate Monitoring Can Significantly Enhance CoT Monitor Performance (Research Note)

Rauno Arike, RohanS and Shubhorup Biswas

8 Aug 2025 10:41 UTC

52 points

7 comments10 min readLW link

The Tortoise and the Language Model (A Fable After Hofstadter)

mwatkins8 Aug 2025 10:39 UTC

55 points

4 comments3 min readLW link

Closed Mouth, Open Oppurtunities

CstineSublime8 Aug 2025 10:32 UTC

6 points

0 comments4 min readLW link

How anticipatory cover-ups go wrong

Kaj_Sotala8 Aug 2025 10:26 UTC

304 points

25 comments6 min readLW link

Strategic Moderation Goals (a Plan B to AI alignment)

Jim Buhler8 Aug 2025 8:08 UTC

2 points

0 comments3 min readLW link

Preface to “Simulacra and Simulators”

Fiora Starlight8 Aug 2025 7:38 UTC

13 points

0 comments7 min readLW link

METR’s Evaluation of GPT-5

GradientDissenter7 Aug 2025 22:17 UTC

148 points

15 comments20 min readLW link

(metr.github.io)

ChatGPT is the Daguerreotype of AI

Alex_Altair7 Aug 2025 22:14 UTC

42 points

2 comments7 min readLW link

Principles of AI Uncontrollability

WillPetillo7 Aug 2025 21:10 UTC

7 points

0 comments7 min readLW link

Third-order cognition as a model of superintelligence (ironically: Meta® metacognition)

soycarts7 Aug 2025 20:56 UTC

0 points

5 comments14 min readLW link

Yes, Rationalism is a Cult

programjames7 Aug 2025 20:43 UTC

−9 points

23 comments4 min readLW link

GPT-5 is out

david reinstein7 Aug 2025 20:33 UTC

4 points

0 comments1 min readLW link

(openai.com)

OpenAI Releases GPT-5

anaguma7 Aug 2025 18:41 UTC

18 points

0 comments1 min readLW link

(openai.com)

Balancing exploration and resistance to memetic threats after AGI

Eric Neyman7 Aug 2025 18:03 UTC

26 points

5 comments5 min readLW link

state of the machine

thiccythot7 Aug 2025 17:50 UTC

22 points

5 comments6 min readLW link

Chronicles of the Gentle Singularity: A Short Story

Ihor Kendiukhov7 Aug 2025 13:50 UTC

25 points

0 comments4 min readLW link

AI #128: Four Hours Until Probably Not The Apocalypse

Zvi7 Aug 2025 13:00 UTC

37 points

5 comments65 min readLW link

(thezvi.wordpress.com)

No One is Really Working

Annapurna7 Aug 2025 11:19 UTC

6 points

9 comments1 min readLW link

(www.humaninvariant.com)

[Question] Anthropic Is Going All In On Ability Without Intelligence?

Chapin Lenthall-Cleary7 Aug 2025 5:54 UTC

2 points

0 comments2 min readLW link

Civil Service: a Victim or a Villain?

Martin Sustrik7 Aug 2025 5:50 UTC

67 points

27 comments4 min readLW link

(www.250bpm.com)

AXRP Episode 46 - Tom Davidson on AI-enabled Coups

DanielFilan7 Aug 2025 5:10 UTC

17 points

0 comments68 min readLW link

A Cheeky Pint with Anthropic CEO Dario Amodei

WilliamKiely7 Aug 2025 3:21 UTC

10 points

3 comments1 min readLW link

Reproducing Absolute Zero

Lucy Wingard7 Aug 2025 3:01 UTC

5 points

1 comment4 min readLW link

Interview with Kelsey Piper on Self-Censorship and the Vibe Shift

Zack_M_Davis7 Aug 2025 2:51 UTC

57 points

1 comment15 min readLW link

(unremediatedgender.space)

Forbes: Fear Of Super Intelligent AI Is Driving Harvard And MIT Students To Drop Out

Nikola Jurkovic7 Aug 2025 2:02 UTC

19 points

0 comments1 min readLW link

(www.forbes.com)

Open weights != Open source

martinkunev7 Aug 2025 1:04 UTC

2 points

8 comments3 min readLW link

No, Rationalism Is Not a Cult

Liam Robins7 Aug 2025 0:39 UTC

23 points

18 comments10 min readLW link

(thelimestack.substack.com)

Critiquing the Dunning-Kruger Effect

Jennifer Young7 Aug 2025 0:36 UTC

0 points

0 comments1 min readLW link

Re: recent Anthropic safety research

Eliezer Yudkowsky6 Aug 2025 22:52 UTC

157 points

24 comments5 min readLW link

(x.com)

It’s Owl in the Numbers: Token Entanglement in Subliminal Learning

Alex Loftus, Amir Zur, Kerem Şahin, zfying and Hadas Orgad

6 Aug 2025 22:18 UTC

41 points

7 comments4 min readLW link

[Question] Inscrutability was always inevitable, right?

Steven Byrnes6 Aug 2025 21:57 UTC

101 points

33 comments2 min readLW link

Claude, GPT, and Gemini All Struggle to Evade Monitors

Vincent Cheng and Thomas Kwa

6 Aug 2025 20:28 UTC

61 points

3 comments5 min readLW link

Opus 4.1 Is An Incremental Improvement

Zvi6 Aug 2025 19:50 UTC

46 points

1 comment6 min readLW link

(thezvi.wordpress.com)

My Mistake, Your Problem

Gordon Seidoh Worley6 Aug 2025 17:41 UTC

9 points

0 comments4 min readLW link

(uncertainupdates.substack.com)

[Question] How useful could stolen AI model weights be without knowing the architecture and activation functions?

Jemal Young6 Aug 2025 17:36 UTC

6 points

5 comments1 min readLW link

Statistical suggestions for mech interp research and beyond

Paul B6 Aug 2025 12:45 UTC

65 points

4 comments15 min readLW link

Investigating Internal Representations of Correctness in SONAR Text Autoencoders

Samuel Nellessen and antonghawthorne

6 Aug 2025 12:13 UTC

5 points

0 comments7 min readLW link

How hard to achieve is eutopia?

wdmacaskill6 Aug 2025 11:02 UTC

17 points

2 comments7 min readLW link

Love, Lies and Misalignment

Priyanka Bharadwaj6 Aug 2025 9:44 UTC

6 points

1 comment3 min readLW link

My current guess at the effect of AI automation on jobs

sortega6 Aug 2025 8:17 UTC

15 points

6 comments2 min readLW link

Zoom Out: Distributions in Semantic Spaces

TristanTrim6 Aug 2025 0:01 UTC

14 points

4 comments4 min readLW link

An opinionated guide to building a good to-do system

bilalchughtai5 Aug 2025 23:00 UTC

24 points

9 comments8 min readLW link

(bilalchughtai.co.uk)

Good Ideas Aren’t Enough in AI Policy

Andersehen5 Aug 2025 22:38 UTC

12 points

0 comments5 min readLW link