All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30 31

Credit goes to the presenter, not the inventor

Algon26 Oct 2025 23:55 UTC

42 points

5 comments3 min readLW link

On Fleshling Safety: A Debate by Klurl and Trapaucius.

Eliezer Yudkowsky26 Oct 2025 23:44 UTC

257 points

52 comments79 min readLW link

Results of “Experiment on Bernoulli processes”

joseph_c26 Oct 2025 21:47 UTC

9 points

2 comments4 min readLW link

certain exotic neurotransmitters as SMART PILLS: or compounds that increase the capacity for mental work in humans

azergante26 Oct 2025 20:51 UTC

4 points

0 comments22 min readLW link

(erowid.org)

Cancer has a surprising amount of detail

Abhishaike Mahajan26 Oct 2025 20:33 UTC

128 points

18 comments11 min readLW link

(www.owlposting.com)

Stability of natural latents in information theoretic terms

Aram Ebtekar26 Oct 2025 20:33 UTC

35 points

0 comments2 min readLW link

Lessons from Teaching Rationality to EAs in the Netherlands

Shoshannah Tekofsky26 Oct 2025 20:03 UTC

20 points

0 comments7 min readLW link

(forum.effectivealtruism.org)

Are We Their Chimps?

soycarts26 Oct 2025 16:04 UTC

−7 points

49 comments1 min readLW link

FWIW: What I noticed at a (Goenka) Vipassana retreat

David Gross26 Oct 2025 15:10 UTC

38 points

4 comments9 min readLW link

Brightline is Actually Pretty Dangerous

jefftk26 Oct 2025 12:51 UTC

55 points

12 comments3 min readLW link

(www.jefftk.com)

Seven-ish Words from My Thought-Language

Lorxus26 Oct 2025 4:30 UTC

68 points

13 comments4 min readLW link

(tiled-with-pentagons.blogspot.com)

Remembrancy

Algon25 Oct 2025 22:47 UTC

11 points

0 comments3 min readLW link

Pygmalion’s Wafer

Charlie Sanders25 Oct 2025 20:17 UTC

8 points

2 comments4 min readLW link

(www.dailymicrofiction.com)

Debating theism

Ivan25 Oct 2025 18:35 UTC

−21 points

0 comments25 min readLW link

[Question] Why is OpenAI releasing products like Sora and Atlas?

J Thomas Moros25 Oct 2025 17:59 UTC

16 points

10 comments1 min readLW link

Origins and dangers of future AI capability denial

Patrick Spencer25 Oct 2025 16:13 UTC

68 points

18 comments10 min readLW link

Do you completely trust that you are completely in the shit? - despair and information -

P. João25 Oct 2025 14:42 UTC

−2 points

17 comments3 min readLW link

Assessing Far UVC Positioning

jefftk25 Oct 2025 14:00 UTC

20 points

3 comments2 min readLW link

(www.jefftk.com)

Musings on Reported Cost of Compute (Oct 2025)

Vladimir_Nesov24 Oct 2025 20:42 UTC

105 points

11 comments2 min readLW link

Regardless of X, you can still just sign superintelligence-statement.org if you agree

Ishual24 Oct 2025 20:30 UTC

58 points

0 comments3 min readLW link

The Future of Interpretability is Geometric

sbaumohl24 Oct 2025 18:32 UTC

26 points

0 comments5 min readLW link

New Statement Calls For Not Building Superintelligence For Now

Zvi24 Oct 2025 17:40 UTC

80 points

3 comments7 min readLW link

(thezvi.wordpress.com)

Notes on “Explaining AI Explainability”

Eleni Angelou24 Oct 2025 17:22 UTC

20 points

0 comments6 min readLW link

Can Reasoning Models Obfuscate Reasoning? Stress-Testing Chain-of-Thought Monitorability

Artur Zolkowski and Wen Xing

24 Oct 2025 17:21 UTC

18 points

1 comment5 min readLW link

I will not sign up for cryonics

Syd Lonreiro_24 Oct 2025 16:56 UTC

−18 points

5 comments1 min readLW link

Dollars in political giving are less fungible than you might think

lincolnquirk24 Oct 2025 15:54 UTC

6 points

1 comment5 min readLW link

(lincolnquirk.substack.com)

Can AI Agents with Divergent Interests Learn To Prevent Civilizational Failures?

joao_abrantes24 Oct 2025 15:08 UTC

1 point

0 comments1 min readLW link

LW Reacts pack for Discord/Slack/etc

plex24 Oct 2025 13:20 UTC

65 points

16 comments1 min readLW link

(drive.google.com)

AI Timelines and Points of no return

Gabriel Alfour24 Oct 2025 11:15 UTC

36 points

8 comments1 min readLW link

(cognition.cafe)

Introducing ControlArena: A library for running AI control experiments

Mojmir24 Oct 2025 9:51 UTC

13 points

0 comments3 min readLW link

(www.aisi.gov.uk)

Can we steer AI models toward safer actions by making these instrumentally useful?

Francesca Gomez24 Oct 2025 9:18 UTC

5 points

0 comments2 min readLW link

(www.wiserhuman.ai)

Plan 1 and Plan 2

Towards_Keeperhood24 Oct 2025 8:18 UTC

50 points

22 comments3 min readLW link

Guys I might be an e/acc

Taylor G. Lunt24 Oct 2025 3:25 UTC

14 points

29 comments4 min readLW link

How an AI company CEO could quietly take over the world

Alex Kastner23 Oct 2025 23:33 UTC

57 points

13 comments11 min readLW link

Worlds Where Iterative Design Succeeds?

Max Harms23 Oct 2025 22:14 UTC

23 points

5 comments8 min readLW link

Automated real time monitoring and orchestration of coding agents

zef, kaivu and leni

23 Oct 2025 22:12 UTC

8 points

0 comments2 min readLW link

(fulcrumresearch.ai)

Reminder: Morality is unsolved

Jesper L.23 Oct 2025 21:42 UTC

27 points

46 comments3 min readLW link

The main way I’ve seen people turn ideologically crazy [Linkpost]

Noosphere8923 Oct 2025 20:09 UTC

135 points

22 comments8 min readLW link

(andymasley.substack.com)

Empirical Partial Derivatives

sonicrocketman23 Oct 2025 17:54 UTC

8 points

0 comments3 min readLW link

(brianschrader.com)

An architecture for understanding

Rebecca Dai23 Oct 2025 17:45 UTC

7 points

0 comments9 min readLW link

(rebeccadai.substack.com)

Beliefs about formal methods and AI safety

Quinn23 Oct 2025 16:43 UTC

32 points

1 comment5 min readLW link

AI #139: The Overreach Machines

Zvi23 Oct 2025 15:30 UTC

35 points

5 comments52 min readLW link

(thezvi.wordpress.com)

Should AI Developers Remove Discussion of AI Misalignment from AI Training Data?

Alek Westover23 Oct 2025 15:12 UTC

51 points

3 comments9 min readLW link

SecureBio is Hiring Software Engineers

jefftk23 Oct 2025 14:10 UTC

21 points

0 comments1 min readLW link

(www.jefftk.com)

Is terminal lucidity real?

Ariel Zeleznikow-Johnston23 Oct 2025 11:40 UTC

20 points

0 comments1 min readLW link

(open.substack.com)

A Concrete Roadmap towards Safety Cases based on Chain-of-Thought Monitoring

Wuschel Schulz23 Oct 2025 11:34 UTC

37 points

5 comments4 min readLW link

(arxiv.org)

Differences in Alignment Behaviour between Single-Agent and Multi-Agent AI Systems

NotAWiz4rd, Cameron Tomé-Moreira and Andreas Hermann

23 Oct 2025 11:17 UTC

7 points

3 comments5 min readLW link

LW Psychosis

Annabelle23 Oct 2025 8:12 UTC

18 points

10 comments3 min readLW link

Announcing the Futurekind Winter Fellowship 2025/6

Aditya S23 Oct 2025 5:40 UTC

1 point

0 comments4 min readLW link

Learning to Interpret Weight Differences in Language Models

avichal23 Oct 2025 3:55 UTC

90 points

3 comments5 min readLW link

(arxiv.org)