All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 151617 18 19 20 21 22 23 24 25 26 27 28 29 30

Your Clone Wants to Kill You Because You Assumed Too Much

Algon15 Nov 2025 23:21 UTC

67 points

10 comments2 min readLW link

Writing Hack: Write It Just Like That

eleweek15 Nov 2025 22:16 UTC

24 points

0 comments3 min readLW link

(psychotechnology.substack.com)

AI loves octopuses

Sean Herrington15 Nov 2025 21:59 UTC

32 points

19 comments5 min readLW link

Punctuation & Quotation Conventions

abramdemski15 Nov 2025 18:13 UTC

21 points

14 comments2 min readLW link

Matrices map between biproducts

jessicata15 Nov 2025 18:05 UTC

41 points

6 comments5 min readLW link

(unstableontology.com)

Don’t use the phrase “human values”

Nina Panickssery15 Nov 2025 16:49 UTC

60 points

10 comments1 min readLW link

Halfway there; on desperation management

Dentosal15 Nov 2025 14:55 UTC

7 points

0 comments2 min readLW link

“Middlemarch” is inane and also one of my favorite books

Ben Pace15 Nov 2025 7:58 UTC

45 points

1 comment11 min readLW link

Just Another Five Minutes

Screwtape15 Nov 2025 7:47 UTC

43 points

4 comments5 min readLW link

Same cognitive paints, exceedingly different mental pictures

Ruby15 Nov 2025 7:13 UTC

17 points

0 comments4 min readLW link

A Love Song to Nicotine

eleweek15 Nov 2025 6:54 UTC

23 points

4 comments5 min readLW link

(psychotechnology.substack.com)

Increasing returns to effort are common

habryka15 Nov 2025 6:53 UTC

114 points

6 comments7 min readLW link

Private Latent Notation and AI-Human Alignment

Robert Shuler15 Nov 2025 5:47 UTC

6 points

1 comment6 min readLW link

On Battle-Short: What, How, and Why Not To

Lorxus15 Nov 2025 5:27 UTC

4 points

0 comments3 min readLW link

(tiled-with-pentagons.blogspot.com)

The Flaw in the Paperclip Maximizer Thought Experiment

Taylor G. Lunt15 Nov 2025 4:46 UTC

3 points

0 comments2 min readLW link

“But You’d Like To Feel Companionate Love, Right? … Right?”

johnswentworth15 Nov 2025 4:28 UTC

70 points

25 comments3 min readLW link

Generation Ship: A Protest Song For PauseAI

LoganStrohl15 Nov 2025 1:17 UTC

43 points

3 comments1 min readLW link

Will AI systems drift into misalignment?

joshc15 Nov 2025 1:03 UTC

15 points

3 comments15 min readLW link

Everyday Clean Air

jefftk15 Nov 2025 1:00 UTC

33 points

5 comments2 min readLW link

(www.jefftk.com)

Some Sun Tsu quotes sound like they’re actually about debates/epistemics

depressurize15 Nov 2025 0:41 UTC

6 points

2 comments1 min readLW link

What are your impossible problems?

Raemon15 Nov 2025 0:28 UTC

28 points

24 comments1 min readLW link

Prediction markets for social deduction games

Mikhail Samin15 Nov 2025 0:18 UTC

10 points

0 comments2 min readLW link

(mikhailsamin.substack.com)

List of great filk songs

Algon15 Nov 2025 0:17 UTC

26 points

5 comments2 min readLW link

a sketch of how we might go about getting basins of corrigibility from RL

williawa14 Nov 2025 22:10 UTC

10 points

0 comments4 min readLW link

Lambda Calculus Prior

abramdemski14 Nov 2025 21:29 UTC

25 points

3 comments4 min readLW link

AI Craziness: Additional Suicide Lawsuits and The Fate of GPT-4o

Zvi14 Nov 2025 20:20 UTC

45 points

0 comments7 min readLW link

(thezvi.wordpress.com)

Understanding and Controlling LLM Generalization

Daniel Tan14 Nov 2025 16:58 UTC

43 points

3 comments1 min readLW link

Lorxus Does Halfhaven: 11/08~11/14

Lorxus14 Nov 2025 13:23 UTC

5 points

0 comments2 min readLW link

(tiled-with-pentagons.blogspot.com)

Finding Balance & Opportunity in the Holiday Flux [free public workshop]

teebarnett14 Nov 2025 10:53 UTC

2 points

2 comments1 min readLW link

From Anthony: Control Inversion

Gabriel Alfour14 Nov 2025 9:36 UTC

10 points

0 comments1 min readLW link

(control-inversion.ai)

LLM would have said this better, and without all these typos too

Dentosal14 Nov 2025 9:33 UTC

8 points

0 comments2 min readLW link

The Charge of the Hobby Horse

TsviBT14 Nov 2025 8:17 UTC

65 points

46 comments5 min readLW link

The Eightfold Path To Enlightened Disagreement

dreeves14 Nov 2025 7:57 UTC

9 points

0 comments3 min readLW link

10 Types of LessWrong Post

Ben Pace14 Nov 2025 7:56 UTC

52 points

2 comments4 min readLW link

Don’t let people buy credit with borrowed funds

habryka14 Nov 2025 7:51 UTC

111 points

43 comments10 min readLW link

Everyone has a plan until they get lied to the face

Screwtape14 Nov 2025 7:22 UTC

183 points

33 comments7 min readLW link

Notes on the book “Talent”

Nina Panickssery14 Nov 2025 5:43 UTC

25 points

1 comment15 min readLW link

(blog.ninapanickssery.com)

[Question] How do you read Less Wrong?

Mitchell_Porter14 Nov 2025 5:17 UTC

20 points

15 comments1 min readLW link

Thoughts are surprisingly detailed and remarkably autonomous

Ruby14 Nov 2025 5:00 UTC

24 points

1 comment3 min readLW link

Halfhaven Digest #4

Taylor G. Lunt14 Nov 2025 4:16 UTC

9 points

0 comments2 min readLW link

AI Corrigibility Debate: Max Harms vs. Jeremy Gillen

Liron, Max Harms and Jeremy Gillen

14 Nov 2025 4:09 UTC

46 points

1 comment75 min readLW link

(doomdebates.com)

Types of systems that could be useful for agent foundations

Alex_Altair14 Nov 2025 3:54 UTC

46 points

3 comments5 min readLW link

The rare, deadly virus lurking in the Southwest US, and the bigger picture

eukaryote14 Nov 2025 3:27 UTC

56 points

1 comment17 min readLW link

(eukaryotewritesblog.com)

Tell people as early as possible it’s not going to work out

habryka14 Nov 2025 2:21 UTC

152 points

17 comments2 min readLW link

Questioning Computationalism

abramdemski14 Nov 2025 1:30 UTC

22 points

7 comments19 min readLW link

Orient Speed in the 21st Century

Raemon14 Nov 2025 1:12 UTC

53 points

14 comments3 min readLW link

(thehumanspirit.substack.com)

Evaluation Avoidance: How Humans and AIs Hack Reward by Disabling Evaluation Instead of Gaming Metrics

Johannes C. Mayer14 Nov 2025 0:39 UTC

19 points

0 comments3 min readLW link

Self-interpretability: LLMs can describe complex internal processes that drive their decisions

Adam Morris and Dillon Plunkett

14 Nov 2025 0:18 UTC

12 points

0 comments4 min readLW link

(Fantasy) → (Planning): A Core Mental Move For Agentic Humans?

johnswentworth14 Nov 2025 0:13 UTC

70 points

6 comments2 min readLW link

[Question] How does one tell apart results in ethics and decision theory?

StanislavKrym13 Nov 2025 23:42 UTC

6 points

0 comments2 min readLW link