All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30

“Self-esteem” is distortionary

Algon23 Nov 2025 23:59 UTC

15 points

3 comments2 min readLW link

Cyberbuddhist Jargon 1.0

lsusr23 Nov 2025 23:39 UTC

50 points

21 comments7 min readLW link

Finding the uncertainty vector in GPT2-scale transformers

larry-dial23 Nov 2025 23:34 UTC

9 points

0 comments10 min readLW link

Stop Applying And Get To Work

Pauliina and plex

23 Nov 2025 22:50 UTC

221 points

58 comments2 min readLW link

Halfhaven Digest #5

Taylor G. Lunt23 Nov 2025 21:57 UTC

15 points

0 comments3 min readLW link

Emotions, Fabricated

Dentosal23 Nov 2025 21:57 UTC

4 points

0 comments2 min readLW link

I’ll be sad to lose the puzzles

Ruby23 Nov 2025 19:37 UTC

112 points

21 comments2 min readLW link

Show Review: Masquerade

johnswentworth23 Nov 2025 19:20 UTC

41 points

2 comments3 min readLW link

AI Sentience and Welfare Misalignment Risk

edgecase6423 Nov 2025 18:22 UTC

14 points

3 comments8 min readLW link

If you cannot be good, at least be bad correctly

beyarkay (Boyd Kane)23 Nov 2025 17:51 UTC

17 points

1 comment2 min readLW link

(boydkane.com)

Please Measure Verification Burden

Quinn23 Nov 2025 17:25 UTC

17 points

4 comments4 min readLW link

Solstice Singalong Watch Party

Linda Linsefors and philh

23 Nov 2025 16:36 UTC

11 points

0 comments1 min readLW link

Busking Practice

jefftk23 Nov 2025 15:20 UTC

16 points

0 comments1 min readLW link

(www.jefftk.com)

The Enemy Gets The Last Hit

J Bostock23 Nov 2025 12:22 UTC

47 points

5 comments3 min readLW link

A list of people who could’ve started a nuclear war, but chose not to

Mikhail Samin23 Nov 2025 9:25 UTC

28 points

5 comments5 min readLW link

Traditional Food

lsusr23 Nov 2025 8:07 UTC

109 points

10 comments9 min readLW link

Memories of a British Boarding School #2.5

Ben Pace23 Nov 2025 7:54 UTC

23 points

2 comments2 min readLW link

Dipole Nature

alkjash23 Nov 2025 7:24 UTC

40 points

2 comments5 min readLW link

(radimentary.wordpress.com)

What kind of person is DeepSeek’s founder, Liang Wenfeng? An answer from his old university classmate.

L.M.Sherlock23 Nov 2025 4:54 UTC

92 points

0 comments4 min readLW link

(lmsherlock.substack.com)

Comment on Natural Emergent Misalignment Paper by Anthropic

Simon Lermen23 Nov 2025 4:21 UTC

21 points

0 comments4 min readLW link

How to throw parties

RobertM23 Nov 2025 3:59 UTC

22 points

0 comments5 min readLW link

Stream of Consciousness as a Scaffolding Skill

Screwtape23 Nov 2025 3:31 UTC

33 points

2 comments4 min readLW link

Literacy is Decreasing Among the Intellectual Class

Taylor G. Lunt23 Nov 2025 3:08 UTC

37 points

29 comments10 min readLW link

Market Logic II

abramdemski23 Nov 2025 1:41 UTC

24 points

3 comments7 min readLW link

You can just do things: 5 frames

Algon23 Nov 2025 0:43 UTC

54 points

3 comments3 min readLW link

Easy vs Hard Emotional Vulnerability

johnswentworth23 Nov 2025 0:15 UTC

34 points

25 comments2 min readLW link

Why your sports car isn’t a racecar (tradeoffs everywhere)

Ruby22 Nov 2025 23:23 UTC

29 points

0 comments5 min readLW link

Assorted Thoughts on “Pivoting” to AI

Trevor Hill-Hand22 Nov 2025 21:17 UTC

12 points

1 comment4 min readLW link

OpenAI Locks Down San Francisco Offices Following Alleged Threat From Activist

Matrice Jacobine22 Nov 2025 19:33 UTC

40 points

0 comments4 min readLW link

(www.wired.com)

Sorry, I still think kidney donation makes no sense for an EA

nicholashalden22 Nov 2025 18:10 UTC

6 points

4 comments1 min readLW link

(substack.com)

Automatic alt text generation

TurnTrout22 Nov 2025 17:57 UTC

27 points

1 comment1 min readLW link

(turntrout.com)

My frustrations: AI doom

Dentosal22 Nov 2025 14:59 UTC

2 points

0 comments2 min readLW link

Introspection in LLMs: A Proposal For How To Think About It, And Test For It

Christopher Ackerman22 Nov 2025 14:52 UTC

23 points

4 comments7 min readLW link

AI Red Lines: A Research Agenda

Charbel-Raphaël22 Nov 2025 8:41 UTC

30 points

1 comment5 min readLW link

Book Review: Wizard’s Hall

Screwtape22 Nov 2025 7:38 UTC

96 points

4 comments5 min readLW link

Be Naughty

habryka22 Nov 2025 6:35 UTC

99 points

11 comments4 min readLW link

Market Logic I

abramdemski22 Nov 2025 6:01 UTC

36 points

2 comments5 min readLW link

The AI 2027 Report Is Not Backed Up by Evidence

Oscar Davies22 Nov 2025 5:23 UTC

−17 points

9 comments4 min readLW link

LLM Systems for Literature-Based Scientific Discovery

Carly Turini22 Nov 2025 4:48 UTC

1 point

0 comments1 min readLW link

Animal welfare concerns are dominated by post-ASI futures

RobertM22 Nov 2025 4:08 UTC

28 points

1 comment4 min readLW link

Habitual mental motions might explain why people are content to get old and die

Ruby22 Nov 2025 2:52 UTC

19 points

1 comment7 min readLW link

D&D.Sci Thanksgiving: the Festival Feast

aphyer22 Nov 2025 2:26 UTC

41 points

15 comments2 min readLW link

Diplomacy during AI takeoff

Nikola Jurkovic22 Nov 2025 2:12 UTC

18 points

3 comments2 min readLW link

(nikolajurkovic.substack.com)

Abstract advice to researchers tackling the difficult core problems of AGI alignment

TsviBT22 Nov 2025 0:53 UTC

130 points

10 comments8 min readLW link

Easy Opportunity to Help Many Animals

Bentham's Bulldog21 Nov 2025 23:03 UTC

10 points

0 comments1 min readLW link

Why Not Just Train For Interpretability?

johnswentworth21 Nov 2025 22:08 UTC

56 points

12 comments4 min readLW link

Complaining about my inability to focus on uninteresting things

Dentosal21 Nov 2025 20:34 UTC

5 points

3 comments2 min readLW link

Models not making it clear when they’re roleplaying seems like a fairly big issue

williawa21 Nov 2025 20:23 UTC

16 points

3 comments6 min readLW link

Natural Emergent Misalignment from Reward Hacking

Algon21 Nov 2025 20:20 UTC

12 points

0 comments3 min readLW link

(www.anthropic.com)

Natural emergent misalignment from reward hacking in production RL

evhub, Monte M, Benjamin Wright and Jonathan Uesato

21 Nov 2025 20:00 UTC

258 points

32 comments9 min readLW link