All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Heroic Responsibility

johnswentworth4 Nov 2025 23:26 UTC

80 points

31 comments2 min readLW link

[Linkpost] Competing Motivations: When More Incentives Lead To Less Effort

Gunnar_Zarncke4 Nov 2025 23:02 UTC

11 points

0 comments1 min readLW link

(x.com)

Not Over Or Under Indexed

Screwtape4 Nov 2025 22:54 UTC

41 points

0 comments6 min readLW link

Being “Usefully Concrete”

Raemon4 Nov 2025 22:15 UTC

44 points

4 comments4 min readLW link

Legible vs. Illegible AI Safety Problems

Wei Dai4 Nov 2025 21:39 UTC

393 points

96 comments2 min readLW link

Parsing Validation

Dentosal4 Nov 2025 21:19 UTC

5 points

1 comment3 min readLW link

A/B testing could lead LLMs to retain users instead of helping them

Daniel Paleka4 Nov 2025 19:30 UTC

28 points

0 comments4 min readLW link

(newsletter.danielpaleka.com)

OpenAI: The Battle of the Board: Ilya’s Testimony

Zvi4 Nov 2025 19:30 UTC

44 points

1 comment5 min readLW link

(thezvi.wordpress.com)

Berkeley Secular Solstice Weekend

Raemon4 Nov 2025 18:37 UTC

22 points

18 comments1 min readLW link

Modeling the geopolitics of AI development

Alex Amadori, Gabriel Alfour, Andrea_Miotti and Eva_B

4 Nov 2025 17:31 UTC

46 points

0 comments2 min readLW link

(ai-scenarios.com)

Thoughts by a non-economist on AI and economics

Boaz Barak4 Nov 2025 17:06 UTC

42 points

2 comments14 min readLW link

GDM: Consistency Training Helps Limit Sycophancy and Jailbreaks in Gemini 2.5 Flash

TurnTrout and Rohin Shah

4 Nov 2025 16:25 UTC

53 points

2 comments6 min readLW link

(arxiv.org)

AI Safety Camp 11

Robert Kralisch and Remmelt

4 Nov 2025 14:56 UTC

8 points

0 comments15 min readLW link

Keeping Ants and Spotting Queens

Morpheus4 Nov 2025 13:49 UTC

12 points

0 comments2 min readLW link

Letter to a close friend

Alexandre Variengien4 Nov 2025 13:17 UTC

9 points

0 comments2 min readLW link

(alexandrevariengien.com)

Open-weight training practices and implications for CoT monitorability

Cam and robert mccarthy

4 Nov 2025 10:49 UTC

20 points

0 comments9 min readLW link

Free Learning in Today’s Society: Some Personal Experiences and Reflections

L.M.Sherlock4 Nov 2025 10:30 UTC

30 points

1 comment41 min readLW link

(lmsherlock.substack.com)

A prayer for engaging in conflict

TsviBT4 Nov 2025 8:19 UTC

68 points

0 comments2 min readLW link

Rainbows, fractals, and crumpled paper: Hölder continuity

Adam Scherlis4 Nov 2025 8:01 UTC

10 points

0 comments3 min readLW link

(adam.scherl.is)

Taste of food

Mikhail Samin4 Nov 2025 7:47 UTC

22 points

0 comments3 min readLW link

(mikhailsamin.substack.com)

Retrospective on US govt whistleblower guide and DB

samuelshadrach4 Nov 2025 7:30 UTC

4 points

0 comments2 min readLW link

(samuelshadrach.com)

US Govt Whistleblower Guide

samuelshadrach4 Nov 2025 7:22 UTC

1 point

6 comments7 min readLW link

(samuelshadrach.com)

US Govt Whistleblower Database

samuelshadrach4 Nov 2025 7:20 UTC

6 points

6 comments33 min readLW link

(samuelshadrach.com)

The Mortifying Ordeal of Knowing Thyself

Philipreal4 Nov 2025 5:16 UTC

6 points

0 comments3 min readLW link

Build the life you actually want

mingyuan4 Nov 2025 4:50 UTC

58 points

3 comments3 min readLW link

(mingyuan.substack.com)

Research Reflections

abramdemski4 Nov 2025 4:33 UTC

97 points

3 comments3 min readLW link

I ate bear fat with honey and salt flakes, to prove a point

aggliu4 Nov 2025 2:00 UTC

326 points

53 comments5 min readLW link

(signoregalilei.com)

Questions About Outperforming Common Wisdom

Notelrac4 Nov 2025 0:38 UTC

2 points

0 comments2 min readLW link

Parleying with the Principled

Screwtape4 Nov 2025 0:23 UTC

14 points

0 comments8 min readLW link

The Zen Of Maxent As A Generalization Of Bayes Updates

johnswentworth and David Lorell

4 Nov 2025 0:02 UTC

63 points

8 comments7 min readLW link

Sam Altman’s track record of manipulation: some quotes from Karen Hao’s “Empire of AI”

i_am_nuts3 Nov 2025 22:25 UTC

21 points

3 comments5 min readLW link

(iamnuts.substack.com)

Comparative advantage & AI

Simon Lermen3 Nov 2025 21:50 UTC

120 points

28 comments4 min readLW link

Just complaining about LLM sycophancy (filler episode)

Dentosal3 Nov 2025 20:33 UTC

7 points

0 comments3 min readLW link

The Tale of the Top-Tier Intellect

Eliezer Yudkowsky3 Nov 2025 20:21 UTC

123 points

68 comments35 min readLW link

Metaphors for Biology: Sizes

Niko McCarty3 Nov 2025 19:40 UTC

1 point

0 comments7 min readLW link

(press.asimov.com)

AI Safety Unconference, Melbourne 2025

mjkerrison3 Nov 2025 19:36 UTC

2 points

0 comments1 min readLW link

[Question] High-Resistance Systems to Change: Can a Political Strategy Apply to Personal Change?

FireBrito de S. Gabriel3 Nov 2025 19:09 UTC

4 points

0 comments1 min readLW link

Leaving Open Philanthropy, going to Anthropic

Joe Carlsmith3 Nov 2025 17:38 UTC

113 points

30 comments18 min readLW link

Red Heart

PeterMcCluskey3 Nov 2025 17:32 UTC

30 points

0 comments3 min readLW link

(bayesianinvestor.com)

Falling AI Costs and the Proliferation of Offensive Capabilities

Felix Choussat3 Nov 2025 17:32 UTC

15 points

2 comments24 min readLW link

The EU could hold AI capabilities development hostage if they wanted to

beyarkay (Boyd Kane)3 Nov 2025 16:54 UTC

3 points

0 comments1 min readLW link

(boydkane.com)

What’s up with Anthropic predicting AGI by early 2027?

ryan_greenblatt3 Nov 2025 16:45 UTC

162 points

16 comments20 min readLW link

To improve Rationality, create Situations

abstractapplic3 Nov 2025 16:10 UTC

18 points

3 comments3 min readLW link

The Unreasonable Effectiveness of Fiction

Raelifin3 Nov 2025 15:35 UTC

220 points

29 comments8 min readLW link

(raelifin.substack.com)

Crime and Punishment #1

Zvi3 Nov 2025 15:30 UTC

51 points

4 comments45 min readLW link

(thezvi.wordpress.com)

Solving a problem with mindware

Alexandre Variengien3 Nov 2025 15:17 UTC

10 points

0 comments2 min readLW link

(alexandrevariengien.com)

Publishing academic papers on transformative AI is a nightmare

Jakub Growiec3 Nov 2025 13:04 UTC

167 points

10 comments4 min readLW link

Pepperoni and the end of morality

ceselder3 Nov 2025 10:15 UTC

1 point

2 comments2 min readLW link

Trying to understand my own cognitive edge

Wei Dai3 Nov 2025 8:49 UTC

74 points

17 comments4 min readLW link

There’s some chance oral herpes is pretty bad for you?

GradientDissenter3 Nov 2025 6:30 UTC

32 points

4 comments6 min readLW link