All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 222324 25 26 27 28 29 30

Why your sports car isn’t a racecar (tradeoffs everywhere)

Ruby22 Nov 2025 23:23 UTC

29 points

0 comments5 min readLW link

Assorted Thoughts on “Pivoting” to AI

Trevor Hill-Hand22 Nov 2025 21:17 UTC

12 points

1 comment4 min readLW link

OpenAI Locks Down San Francisco Offices Following Alleged Threat From Activist

Matrice Jacobine22 Nov 2025 19:33 UTC

40 points

0 comments4 min readLW link

(www.wired.com)

Sorry, I still think kidney donation makes no sense for an EA

nicholashalden22 Nov 2025 18:10 UTC

6 points

4 comments1 min readLW link

(substack.com)

Automatic alt text generation

TurnTrout22 Nov 2025 17:57 UTC

27 points

1 comment1 min readLW link

(turntrout.com)

My frustrations: AI doom

Dentosal22 Nov 2025 14:59 UTC

2 points

0 comments2 min readLW link

Introspection in LLMs: A Proposal For How To Think About It, And Test For It

Christopher Ackerman22 Nov 2025 14:52 UTC

23 points

4 comments7 min readLW link

AI Red Lines: A Research Agenda

Charbel-Raphaël22 Nov 2025 8:41 UTC

30 points

1 comment5 min readLW link

Book Review: Wizard’s Hall

Screwtape22 Nov 2025 7:38 UTC

96 points

4 comments5 min readLW link

Be Naughty

habryka22 Nov 2025 6:35 UTC

99 points

11 comments4 min readLW link

Market Logic I

abramdemski22 Nov 2025 6:01 UTC

36 points

2 comments5 min readLW link

The AI 2027 Report Is Not Backed Up by Evidence

Oscar Davies22 Nov 2025 5:23 UTC

−17 points

9 comments4 min readLW link

LLM Systems for Literature-Based Scientific Discovery

Carly Turini22 Nov 2025 4:48 UTC

1 point

0 comments1 min readLW link

Animal welfare concerns are dominated by post-ASI futures

RobertM22 Nov 2025 4:08 UTC

28 points

1 comment4 min readLW link

Habitual mental motions might explain why people are content to get old and die

Ruby22 Nov 2025 2:52 UTC

19 points

1 comment7 min readLW link

D&D.Sci Thanksgiving: the Festival Feast

aphyer22 Nov 2025 2:26 UTC

41 points

15 comments2 min readLW link

Diplomacy during AI takeoff

Nikola Jurkovic22 Nov 2025 2:12 UTC

18 points

3 comments2 min readLW link

(nikolajurkovic.substack.com)

Abstract advice to researchers tackling the difficult core problems of AGI alignment

TsviBT22 Nov 2025 0:53 UTC

130 points

10 comments8 min readLW link

Easy Opportunity to Help Many Animals

Bentham's Bulldog21 Nov 2025 23:03 UTC

10 points

0 comments1 min readLW link

Why Not Just Train For Interpretability?

johnswentworth21 Nov 2025 22:08 UTC

56 points

12 comments4 min readLW link

Complaining about my inability to focus on uninteresting things

Dentosal21 Nov 2025 20:34 UTC

5 points

3 comments2 min readLW link

Models not making it clear when they’re roleplaying seems like a fairly big issue

williawa21 Nov 2025 20:23 UTC

16 points

3 comments6 min readLW link

Natural Emergent Misalignment from Reward Hacking

Algon21 Nov 2025 20:20 UTC

12 points

0 comments3 min readLW link

(www.anthropic.com)

Natural emergent misalignment from reward hacking in production RL

evhub, Monte M, Benjamin Wright and Jonathan Uesato

21 Nov 2025 20:00 UTC

258 points

32 comments9 min readLW link

Eight Heuristics of Anti-Epistemology

Ben Pace21 Nov 2025 19:54 UTC

44 points

2 comments6 min readLW link

We won’t solve post-alignment problems by doing research

MichaelDickens21 Nov 2025 18:03 UTC

24 points

11 comments4 min readLW link

Can Artificial Intelligence Be Conscious?

Bentham's Bulldog21 Nov 2025 16:43 UTC

15 points

5 comments7 min readLW link

Gemini 3: Model Card and Safety Framework Report

Zvi21 Nov 2025 16:40 UTC

33 points

0 comments11 min readLW link

(thezvi.wordpress.com)

Lorxus Does Halfhaven: 11/15~11/21

Lorxus21 Nov 2025 16:07 UTC

7 points

0 comments1 min readLW link

(tiled-with-pentagons.blogspot.com)

EA Hotel Solstice

plex21 Nov 2025 15:13 UTC

8 points

0 comments1 min readLW link

Why Does Empathy Have an Off-Switch?

J Bostock21 Nov 2025 14:56 UTC

9 points

1 comment7 min readLW link

What Do We Tell the Humans? Errors, Hallucinations, and Lies in the AI Village

Shoshannah Tekofsky21 Nov 2025 14:19 UTC

56 points

0 comments9 min readLW link

URGENT @everyone—help us kill AI preemption (again) before this Friday

Wes R and Felix De Simone

21 Nov 2025 12:51 UTC

−1 points

0 comments1 min readLW link

Should I Apply to a 3.5% Acceptance-Rate Fellowship? A Simple EV Calculator

Tobias H21 Nov 2025 10:59 UTC

16 points

0 comments5 min readLW link

Towards Humanist Superintelligence

Chris_Leong21 Nov 2025 10:22 UTC

17 points

3 comments1 min readLW link

(microsoft.ai)

16 Writing Tips from Inkhaven

dreeves21 Nov 2025 7:49 UTC

13 points

1 comment2 min readLW link

Reading My Diary: 10 Years Since CFAR

Ben Pace21 Nov 2025 7:27 UTC

71 points

1 comment6 min readLW link

The Worrying Nature of Akrasia

Notelrac21 Nov 2025 7:00 UTC

2 points

0 comments4 min readLW link

10 Key Insights from the “Frontier AI Risk Monitoring Platform”

Weibing Wang21 Nov 2025 6:07 UTC

3 points

0 comments2 min readLW link

Contra Collisteru: You Get About One Carthage

Screwtape21 Nov 2025 5:33 UTC

36 points

2 comments5 min readLW link

Infinitesimally False

Adrià Garriga-alonso and abramdemski

21 Nov 2025 4:57 UTC

55 points

16 comments12 min readLW link

Preferences are confusing

RobertM21 Nov 2025 3:07 UTC

28 points

1 comment2 min readLW link

Can questions rigidly designate intentions?

Mason Broxham21 Nov 2025 2:00 UTC

1 point

0 comments5 min readLW link

Week 3: Adversarial Robustness

Ely Hahami21 Nov 2025 1:43 UTC

1 point

0 comments3 min readLW link

Informed Consent as the Sole Criterion for Medical Treatment

Character#273621 Nov 2025 1:39 UTC

7 points

2 comments4 min readLW link

Suicide Prevention Ought To Be Illegal

Character#273621 Nov 2025 1:39 UTC

−17 points

17 comments6 min readLW link

How you got RL’d into your idiosyncratic cognition

Ruby21 Nov 2025 1:06 UTC

16 points

6 comments6 min readLW link

PSA: For Chronic Infections, Check Teeth

Algon20 Nov 2025 23:14 UTC

15 points

2 comments1 min readLW link

[Paper] Output Supervision Can Obfuscate the CoT

jacob_drori, lukemarks, cloud and TurnTrout

20 Nov 2025 22:41 UTC

92 points

3 comments5 min readLW link

(arxiv.org)

The Boring Part of Bell Labs

Elizabeth20 Nov 2025 22:40 UTC

133 points

0 comments15 min readLW link

(acesounderglass.com)