All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Fairly Breaking Ties Without Fair Coins

Brendan Long11 Nov 2025 21:48 UTC

11 points

10 comments4 min readLW link

(www.brendanlong.com)

Kimi K2 Thinking

Zvi11 Nov 2025 21:10 UTC

47 points

0 comments5 min readLW link

(thezvi.wordpress.com)

Not-A-Book Review: The Attractive Man (Dating Coach Service)

25Hour11 Nov 2025 20:03 UTC

15 points

0 comments1 min readLW link

(lifeimprovementschemes.substack.com)

Don’t Get One-Shotted

Jordan Rubin11 Nov 2025 17:07 UTC

−1 points

2 comments6 min readLW link

(jordanmrubin.substack.com)

Learnings from the Zurich AI Safety Day

MariusWenk, Al-Hussein Saqr and bluedotimpact

11 Nov 2025 17:00 UTC

14 points

0 comments6 min readLW link

Steering Language Models with Weight Arithmetic

Fabien Roger and constanzafierro

11 Nov 2025 16:30 UTC

88 points

6 comments5 min readLW link

Announcing the Society of Teen Scientists

rogersbacon11 Nov 2025 16:08 UTC

8 points

0 comments1 min readLW link

What is Happening in AI Governance?

Alexander Müller and Thomas Vassil Brcic

11 Nov 2025 15:59 UTC

6 points

0 comments5 min readLW link

Human Agency at Stake

Alexander Müller and senyakk

11 Nov 2025 15:57 UTC

8 points

0 comments6 min readLW link

Omniscience one bit at a time: Chapter 3

Dentosal11 Nov 2025 13:34 UTC

2 points

0 comments2 min readLW link

Evolution’s Alignment Solution: Why Burnout Prevents Monsters

Elias_Kunnas11 Nov 2025 13:32 UTC

9 points

0 comments6 min readLW link

Thick practices for AI tools

Alexandre Variengien11 Nov 2025 13:13 UTC

19 points

2 comments20 min readLW link

(alexandrevariengien.com)

The problem of graceful deference

TsviBT11 Nov 2025 8:17 UTC

108 points

41 comments4 min readLW link

See Your Word Count While You Write

dreeves11 Nov 2025 8:02 UTC

7 points

3 comments2 min readLW link

On Stance

Screwtape11 Nov 2025 7:50 UTC

24 points

5 comments6 min readLW link

Breaking the Hedonic Rubber Band

Ben Pace11 Nov 2025 7:00 UTC

20 points

4 comments4 min readLW link

Rejecting “Goodness” Does Not Mean Hammering The Defect Button

johnswentworth11 Nov 2025 6:50 UTC

25 points

6 comments2 min readLW link

Strengthening Red Teams: A Modular Scaffold for Control Evaluations

Chloe Loughridge11 Nov 2025 6:20 UTC

7 points

0 comments1 min readLW link

(alignment.anthropic.com)

On the Normativity of Debate: A Discussion With Said Achmiz

Zack_M_Davis11 Nov 2025 5:49 UTC

21 points

1 comment22 min readLW link

Question the Requirements

habryka11 Nov 2025 5:25 UTC

95 points

12 comments3 min readLW link

France is ready to stand alone

Lucie Philippon11 Nov 2025 5:09 UTC

32 points

6 comments2 min readLW link

(aelerinya.substack.com)

Love is Willingness to do Violence

Eneasz11 Nov 2025 5:09 UTC

16 points

9 comments2 min readLW link

(deathisbad.substack.com)

Don’t cancel out your rewards!

Sneha Bangalore11 Nov 2025 5:04 UTC

1 point

0 comments15 min readLW link

Turning Grey

Taylor G. Lunt11 Nov 2025 4:40 UTC

8 points

0 comments11 min readLW link

The AI bubble covered in the Atlantic

Remmelt11 Nov 2025 4:11 UTC

4 points

0 comments2 min readLW link

(www.theatlantic.com)

A Simple Sing-along Solstice

maia11 Nov 2025 2:49 UTC

34 points

3 comments1 min readLW link

(tigrennatenn.neocities.org)

Universal Basic Income in an AGI Future

Simon Lermen11 Nov 2025 2:26 UTC

21 points

1 comment2 min readLW link

(simonlermen.substack.com)

Ternary plots are underrated

Adam Scherlis11 Nov 2025 2:19 UTC

17 points

1 comment3 min readLW link

(adam.scherl.is)

How likely is dangerous AI in the short term?

Nikola Jurkovic11 Nov 2025 2:14 UTC

26 points

3 comments4 min readLW link

(nikolajurkovic.substack.com)

On model weight preservation: Anthropic’s new initiative

Olle Häggström11 Nov 2025 1:12 UTC

16 points

2 comments1 min readLW link

(haggstrom.substack.com)

Pause from Behind / Losing Heroically

enterthewoods11 Nov 2025 1:12 UTC

0 points

0 comments5 min readLW link

[Linkpost] Galaxy brain resistance

derikk11 Nov 2025 0:43 UTC

4 points

0 comments1 min readLW link

(vitalik.eth.limo)

A pencil is not a pencil is not a pencil

Algon10 Nov 2025 23:59 UTC

18 points

4 comments2 min readLW link

The Open Strategy Dictator Game: An Experiment in Transparent Cooperation

Michael Glass10 Nov 2025 23:26 UTC

13 points

2 comments1 min readLW link

DC/Maryland Secular Solstice

maia10 Nov 2025 23:25 UTC

13 points

2 comments1 min readLW link

What I learned building a language-learning app

depressurize10 Nov 2025 21:04 UTC

5 points

1 comment10 min readLW link

(chadnauseam.com)

Andrej Karpathy on LLM cognitive deficits

Nina Panickssery10 Nov 2025 21:02 UTC

45 points

3 comments5 min readLW link

(www.dwarkesh.com)

Consciousness as a Distributed Ponzi Scheme

abramdemski10 Nov 2025 20:18 UTC

34 points

11 comments4 min readLW link

Maat—Intro Post

TristanTrim10 Nov 2025 20:09 UTC

3 points

0 comments1 min readLW link

Variously Effective Altruism

Zvi10 Nov 2025 19:21 UTC

14 points

3 comments8 min readLW link

(thezvi.wordpress.com)

Why does everything feel so urgent?

mingyuan10 Nov 2025 18:11 UTC

19 points

8 comments3 min readLW link

(mingyuan.substack.com)

Omniscience one bit at a time: Chapter 2

Dentosal10 Nov 2025 15:47 UTC

4 points

0 comments2 min readLW link

Social drives 1: “Sympathy Reward”, from compassion to dehumanization

Steven Byrnes10 Nov 2025 14:53 UTC

36 points

7 comments13 min readLW link

Ontology for AI Cults and Cyborg Egregores

Jan_Kulveit10 Nov 2025 13:19 UTC

68 points

14 comments2 min readLW link

From Vitalik: Galaxy brain resistance

Gabriel Alfour10 Nov 2025 13:06 UTC

115 points

2 comments1 min readLW link

(vitalik.eth.limo)

The jailbreak argument against LLM values

technicalities10 Nov 2025 12:05 UTC

29 points

2 comments6 min readLW link

The grapefruit juice effect

Adam Scherlis10 Nov 2025 8:49 UTC

38 points

1 comment5 min readLW link

(adam.scherl.is)

Against Powerful Text Editors

dreeves10 Nov 2025 8:11 UTC

10 points

11 comments2 min readLW link

MtG Colour Wheel applied to Politics

samuelshadrach10 Nov 2025 5:05 UTC

−5 points

6 comments6 min readLW link

(samuelshadrach.com)

The only important ASI timeline

beyarkay (Boyd Kane)10 Nov 2025 4:53 UTC

2 points

4 comments1 min readLW link

(boydkane.com)