5 Nov 2025 23:45 UTC

6 points

0 comments9 min readLW link

Sentient Futures Summit 2026 Bay Area: Apply to Speak!

jonahmattwoodward5 Nov 2025 23:39 UTC

1 point

0 comments1 min readLW link

Breaking Books: A tool to bring books to the social sphere

Alexandre Variengien5 Nov 2025 22:53 UTC

17 points

1 comment8 min readLW link

(alexandrevariengien.com)

Digital minimalism is out, digital intentionality is in

mingyuan5 Nov 2025 22:01 UTC

32 points

1 comment2 min readLW link

(mingyuan.substack.com)

Anthropic Commits To Model Weight Preservation

Zvi5 Nov 2025 21:30 UTC

84 points

13 comments14 min readLW link

(thezvi.wordpress.com)

Meta-agentic Prisoner’s Dilemmas

TsviBT5 Nov 2025 16:44 UTC

39 points

0 comments5 min readLW link

Living in the Shadow of The Sort

Gordon Seidoh Worley5 Nov 2025 16:31 UTC

23 points

5 comments5 min readLW link

(www.uncertainupdates.com)

Hardening against AI takeover is difficult, but we should try

otto.barten5 Nov 2025 16:25 UTC

11 points

0 comments5 min readLW link

(www.existentialriskobservatory.org)

AI Safety at the Frontier: Paper Highlights of October 2025

gasteigerjo5 Nov 2025 13:39 UTC

7 points

0 comments8 min readLW link

(aisafetyfrontier.substack.com)

New homepage for AI safety resources – AISafety.com redesign

Bryce Robertson, Søren Elverlin and honeybee

5 Nov 2025 10:33 UTC

35 points

2 comments1 min readLW link

An atheist’s guide to prayer

Nathan Young5 Nov 2025 9:51 UTC

18 points

3 comments5 min readLW link

(open.substack.com)

Theory of Change for US Govt Whistleblower Database and Guide

samuelshadrach5 Nov 2025 9:08 UTC

2 points

0 comments14 min readLW link

(samuelshadrach.com)

AGI is building itself

Anonim Anonymous5 Nov 2025 8:52 UTC

−9 points

1 comment1 min readLW link

Suffering is what makes it special

Dentosal5 Nov 2025 8:04 UTC

−2 points

1 comment2 min readLW link

Maxwell’s Demon and the Arrow of Time

Adam Scherlis5 Nov 2025 7:35 UTC

29 points

2 comments6 min readLW link

(adam.scherl.is)

How to be convincing when talking to people about existential threat from AI

Mikhail Samin5 Nov 2025 7:01 UTC

35 points

2 comments5 min readLW link

[FICTION] Sable and Able: A Tale of Two ASIs

Mr Beastly5 Nov 2025 6:18 UTC

−3 points

0 comments18 min readLW link

Why Safety Constraints in LLMs Are Easily Breakable? Knowledge as a Network of Gated Circuits

Aditya Raj5 Nov 2025 5:20 UTC

12 points

0 comments4 min readLW link

Using math to foster acceptance and equality

jackoda5 Nov 2025 5:16 UTC

−1 points

0 comments1 min readLW link

Dario Amodei’s “Machines of Loving Grace” sounds incredibly dangerous, for Humans

Super AGI5 Nov 2025 4:42 UTC

14 points

1 comment1 min readLW link

What are you excited about doing?

mingyuan5 Nov 2025 4:40 UTC

19 points

0 comments2 min readLW link

(mingyuan.substack.com)

Intentionality

abramdemski5 Nov 2025 4:30 UTC

30 points

4 comments2 min readLW link

Food-related things that have made my life a little better

Philipreal5 Nov 2025 3:47 UTC

7 points

1 comment2 min readLW link

Gerrymandering California

Nisan5 Nov 2025 2:46 UTC

14 points

0 comments3 min readLW link

How to survive until AGI

Nikola Jurkovic5 Nov 2025 1:17 UTC

28 points

3 comments3 min readLW link

(nikolajurkovic.substack.com)

Heroic Responsibility

johnswentworth4 Nov 2025 23:26 UTC

80 points

31 comments2 min readLW link

[Linkpost] Competing Motivations: When More Incentives Lead To Less Effort

Gunnar_Zarncke4 Nov 2025 23:02 UTC

11 points

0 comments1 min readLW link

(x.com)

Not Over Or Under Indexed

Screwtape4 Nov 2025 22:54 UTC

41 points

0 comments6 min readLW link

Being “Usefully Concrete”

Raemon4 Nov 2025 22:15 UTC

44 points

4 comments4 min readLW link

Legible vs. Illegible AI Safety Problems

Wei Dai4 Nov 2025 21:39 UTC

393 points

96 comments2 min readLW link

Parsing Validation

Dentosal4 Nov 2025 21:19 UTC

5 points

1 comment3 min readLW link

A/B testing could lead LLMs to retain users instead of helping them

Daniel Paleka4 Nov 2025 19:30 UTC

28 points

0 comments4 min readLW link

(newsletter.danielpaleka.com)

OpenAI: The Battle of the Board: Ilya’s Testimony

Zvi4 Nov 2025 19:30 UTC

44 points

1 comment5 min readLW link

(thezvi.wordpress.com)

Berkeley Secular Solstice Weekend

Raemon4 Nov 2025 18:37 UTC

22 points

18 comments1 min readLW link

Modeling the geopolitics of AI development

Alex Amadori, Gabriel Alfour, Andrea_Miotti and Eva_B

4 Nov 2025 17:31 UTC

46 points

0 comments2 min readLW link

(ai-scenarios.com)

Thoughts by a non-economist on AI and economics

Boaz Barak4 Nov 2025 17:06 UTC

42 points

2 comments14 min readLW link

GDM: Consistency Training Helps Limit Sycophancy and Jailbreaks in Gemini 2.5 Flash

TurnTrout and Rohin Shah

4 Nov 2025 16:25 UTC

53 points

2 comments6 min readLW link

(arxiv.org)

AI Safety Camp 11

Robert Kralisch and Remmelt

4 Nov 2025 14:56 UTC

8 points

0 comments15 min readLW link

Keeping Ants and Spotting Queens

Morpheus4 Nov 2025 13:49 UTC

12 points

0 comments2 min readLW link

Letter to a close friend

Alexandre Variengien4 Nov 2025 13:17 UTC

9 points

0 comments2 min readLW link

(alexandrevariengien.com)

Open-weight training practices and implications for CoT monitorability

Cam and robert mccarthy

4 Nov 2025 10:49 UTC

20 points

0 comments9 min readLW link

Free Learning in Today’s Society: Some Personal Experiences and Reflections

L.M.Sherlock4 Nov 2025 10:30 UTC

30 points

1 comment41 min readLW link

(lmsherlock.substack.com)

A prayer for engaging in conflict

TsviBT4 Nov 2025 8:19 UTC

68 points

0 comments2 min readLW link

Rainbows, fractals, and crumpled paper: Hölder continuity

Adam Scherlis4 Nov 2025 8:01 UTC

10 points

0 comments3 min readLW link

(adam.scherl.is)

Taste of food

Mikhail Samin4 Nov 2025 7:47 UTC

22 points

0 comments3 min readLW link

(mikhailsamin.substack.com)

Retrospective on US govt whistleblower guide and DB

samuelshadrach4 Nov 2025 7:30 UTC

4 points

0 comments2 min readLW link

(samuelshadrach.com)

US Govt Whistleblower Guide

samuelshadrach4 Nov 2025 7:22 UTC

1 point

6 comments7 min readLW link

(samuelshadrach.com)

US Govt Whistleblower Database

samuelshadrach4 Nov 2025 7:20 UTC

6 points

6 comments33 min readLW link

(samuelshadrach.com)

The Mortifying Ordeal of Knowing Thyself

Philipreal4 Nov 2025 5:16 UTC

6 points

0 comments3 min readLW link

Build the life you actually want

mingyuan4 Nov 2025 4:50 UTC

58 points

3 comments3 min readLW link

(mingyuan.substack.com)