All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 272829 30 31

A Bayesian Explanation of Causal Models

Menotim27 Oct 2025 23:16 UTC

2 points

0 comments25 min readLW link

Brainstorming Food on the Cheap + Healthy + Convenient + Edible Frontier

Morpheus27 Oct 2025 23:04 UTC

20 points

3 comments4 min readLW link

Transactional method for non-transactional relationship: Relationship as a Common-pool Resource problem

David H.27 Oct 2025 22:29 UTC

2 points

0 comments7 min readLW link

[Question] How Important is Inverting LLMs?

Maloew27 Oct 2025 20:59 UTC

8 points

1 comment1 min readLW link

Asking (Some Of) The Right Questions

Zvi27 Oct 2025 19:00 UTC

31 points

3 comments14 min readLW link

(thezvi.wordpress.com)

life lessons from trading

thiccythot27 Oct 2025 16:56 UTC

43 points

3 comments4 min readLW link

Agentic Monitoring for AI Control

LAThomson27 Oct 2025 16:38 UTC

10 points

0 comments9 min readLW link

Major survey on the HS/TS spectrum and gAyGP

tailcalled27 Oct 2025 14:31 UTC

24 points

3 comments8 min readLW link

Death of the Author

J Bostock27 Oct 2025 12:35 UTC

5 points

0 comments3 min readLW link

Exploring the multi-dimensional refusal subspace in reasoning models

Le magicien quantique27 Oct 2025 9:03 UTC

5 points

2 comments4 min readLW link

AIs should also refuse to work on capabilities research

Davidmanheim27 Oct 2025 8:42 UTC

171 points

22 comments3 min readLW link

Uncommon Utilitarianism #3: Bounded Utility Functions

Alice Blair27 Oct 2025 5:06 UTC

16 points

10 comments6 min readLW link

List of lists of project ideas in AI Safety

Veronica Gordi27 Oct 2025 1:28 UTC

12 points

0 comments14 min readLW link

(www.notion.so)

[Question] How valuable is money-in-market?

Hruss27 Oct 2025 0:47 UTC

6 points

1 comment1 min readLW link

Credit goes to the presenter, not the inventor

Algon26 Oct 2025 23:55 UTC

44 points

5 comments3 min readLW link

On Fleshling Safety: A Debate by Klurl and Trapaucius.

Eliezer Yudkowsky26 Oct 2025 23:44 UTC

258 points

50 comments79 min readLW link

Results of “Experiment on Bernoulli processes”

joseph_c26 Oct 2025 21:47 UTC

9 points

2 comments4 min readLW link

certain exotic neurotransmitters as SMART PILLS: or compounds that increase the capacity for mental work in humans

azergante26 Oct 2025 20:51 UTC

4 points

0 comments22 min readLW link

(erowid.org)

Cancer has a surprising amount of detail

Abhishaike Mahajan26 Oct 2025 20:33 UTC

132 points

18 comments11 min readLW link

(www.owlposting.com)

Stability of natural latents in information theoretic terms

Aram Ebtekar26 Oct 2025 20:33 UTC

36 points

0 comments2 min readLW link

Lessons from Teaching Rationality to EAs in the Netherlands

Shoshannah Tekofsky26 Oct 2025 20:03 UTC

20 points

0 comments7 min readLW link

(forum.effectivealtruism.org)

Are We Their Chimps?

soycarts26 Oct 2025 16:04 UTC

−7 points

49 comments1 min readLW link

FWIW: What I noticed at a (Goenka) Vipassana retreat

David Gross26 Oct 2025 15:10 UTC

39 points

5 comments9 min readLW link

Brightline is Actually Pretty Dangerous

jefftk26 Oct 2025 12:51 UTC

55 points

12 comments3 min readLW link

(www.jefftk.com)

Seven-ish Words from My Thought-Language

Lorxus26 Oct 2025 4:30 UTC

68 points

13 comments4 min readLW link

(tiled-with-pentagons.blogspot.com)

Remembrancy

Algon25 Oct 2025 22:47 UTC

11 points

0 comments3 min readLW link

Pygmalion’s Wafer

Charlie Sanders25 Oct 2025 20:17 UTC

8 points

2 comments4 min readLW link

(www.dailymicrofiction.com)

Debating theism

Ivan25 Oct 2025 18:35 UTC

−21 points

0 comments25 min readLW link

[Question] Why is OpenAI releasing products like Sora and Atlas?

J Thomas Moros25 Oct 2025 17:59 UTC

16 points

10 comments1 min readLW link

Origins and dangers of future AI capability denial

Patrick Spencer25 Oct 2025 16:13 UTC

70 points

18 comments10 min readLW link

Do you completely trust that you are completely in the shit? - despair and information -

FireBrito de S. Gabriel25 Oct 2025 14:42 UTC

−2 points

17 comments3 min readLW link

Assessing Far UVC Positioning

jefftk25 Oct 2025 14:00 UTC

21 points

3 comments2 min readLW link

(www.jefftk.com)

Musings on Reported Cost of Compute (Oct 2025)

Vladimir_Nesov24 Oct 2025 20:42 UTC

107 points

11 comments2 min readLW link

Regardless of X, you can still just sign superintelligence-statement.org if you agree

Ishual24 Oct 2025 20:30 UTC

58 points

0 comments3 min readLW link

The Future of Interpretability is Geometric

sbaumohl24 Oct 2025 18:32 UTC

27 points

0 comments5 min readLW link

New Statement Calls For Not Building Superintelligence For Now

Zvi24 Oct 2025 17:40 UTC

80 points

3 comments7 min readLW link

(thezvi.wordpress.com)

Notes on “Explaining AI Explainability”

Eleni Angelou24 Oct 2025 17:22 UTC

20 points

0 comments6 min readLW link

Can Reasoning Models Obfuscate Reasoning? Stress-Testing Chain-of-Thought Monitorability

Artur Zolkowski and Wen Xing

24 Oct 2025 17:21 UTC

23 points

1 comment5 min readLW link

I will not sign up for cryonics

Syd Lonreiro_24 Oct 2025 16:56 UTC

−18 points

5 comments1 min readLW link

Dollars in political giving are less fungible than you might think

lincolnquirk24 Oct 2025 15:54 UTC

6 points

1 comment5 min readLW link

(lincolnquirk.substack.com)

Can AI Agents with Divergent Interests Learn To Prevent Civilizational Failures?

joao_abrantes24 Oct 2025 15:08 UTC

1 point

0 comments1 min readLW link

LW Reacts pack for Discord/Slack/etc

plex24 Oct 2025 13:20 UTC

65 points

16 comments1 min readLW link

(drive.google.com)

AI Timelines and Points of no return

Gabriel Alfour24 Oct 2025 11:15 UTC

36 points

8 comments1 min readLW link

(cognition.cafe)

Introducing ControlArena: A library for running AI control experiments

Mojmir24 Oct 2025 9:51 UTC

13 points

0 comments3 min readLW link

(www.aisi.gov.uk)

Can we steer AI models toward safer actions by making these instrumentally useful?

Francesca Gomez24 Oct 2025 9:18 UTC

5 points

0 comments2 min readLW link

(www.wiserhuman.ai)

Plan 1 and Plan 2

Towards_Keeperhood24 Oct 2025 8:18 UTC

54 points

22 comments3 min readLW link

Guys I might be an e/acc

Taylor G. Lunt24 Oct 2025 3:25 UTC

14 points

29 comments4 min readLW link

How an AI company CEO could quietly take over the world

Alex Kastner23 Oct 2025 23:33 UTC

61 points

13 comments11 min readLW link

Worlds Where Iterative Design Succeeds?

Max Harms23 Oct 2025 22:14 UTC

23 points

5 comments8 min readLW link

Automated real time monitoring and orchestration of coding agents

zef, kaivu and leni

23 Oct 2025 22:12 UTC

8 points

0 comments2 min readLW link

(fulcrumresearch.ai)