All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

Mechanize Work’s essay on Unfalsifiable Doom

StanislavKrym30 Dec 2025 22:57 UTC

10 points

0 comments15 min readLW link

(www.mechanize.work)

The 7 Types Of Advice (And 3 Common Failure Modes)

Linch30 Dec 2025 21:55 UTC

27 points

3 comments7 min readLW link

(inchpin.substack.com)

Don’t Sell Stock to Donate

jefftk30 Dec 2025 19:50 UTC

113 points

13 comments2 min readLW link

(www.jefftk.com)

The origin of rot

Abhishaike Mahajan30 Dec 2025 17:51 UTC

33 points

4 comments5 min readLW link

(www.owlposting.com)

[Advanced Intro to AI Alignment] 1. Goal-Directed Reasoning and Why It Matters

Towards_Keeperhood30 Dec 2025 15:48 UTC

12 points

4 comments10 min readLW link

Dating Roundup #9: Signals and Selection

Zvi30 Dec 2025 12:40 UTC

38 points

3 comments13 min readLW link

(thezvi.wordpress.com)

Many can write faster asm than the compiler, yet don’t. Why?

faul_sname30 Dec 2025 8:40 UTC

77 points

18 comments4 min readLW link

Exceptionally Gifted Children

John Boyle30 Dec 2025 6:28 UTC

24 points

3 comments1 min readLW link

Chromosome identification methods

TsviBT30 Dec 2025 6:02 UTC

38 points

4 comments5 min readLW link

CFAR’s todo list re: our workshops

AnnaSalamon30 Dec 2025 5:16 UTC

63 points

7 comments3 min readLW link

More details on CFAR’s new workshops

AnnaSalamon30 Dec 2025 5:12 UTC

61 points

2 comments4 min readLW link

What’s going on at CFAR? (Updates and Fundraiser)

AnnaSalamon30 Dec 2025 5:00 UTC

110 points

39 comments30 min readLW link

End-of year donation taxes 101

GradientDissenter30 Dec 2025 2:16 UTC

35 points

1 comment3 min readLW link

Boston Solstice 2025 Retrospective

jefftk30 Dec 2025 1:10 UTC

13 points

2 comments5 min readLW link

(www.jefftk.com)

[Question] Does the USG have access to smarter models than the labs’?

jacob_drori29 Dec 2025 22:51 UTC

9 points

5 comments1 min readLW link

24% of the US public is now aware of AI xrisk

otto.barten29 Dec 2025 22:03 UTC

30 points

3 comments1 min readLW link

Steering RL Training: Benchmarking Interventions Against Reward Hacking

ariaw, Josh Engels and Neel Nanda

29 Dec 2025 21:55 UTC

72 points

11 comments19 min readLW link

Awareness Jailbreaking: Revealing True Alignment in Evaluation-Aware Models

Maheep Chaudhary29 Dec 2025 21:29 UTC

11 points

0 comments4 min readLW link

December 2025 Links

nomagicpill29 Dec 2025 20:20 UTC

8 points

0 comments7 min readLW link

(nomagicpill.substack.com)

The Techno-Humanist Manifesto, wrapup and publishing announcement

jasoncrawford29 Dec 2025 18:51 UTC

13 points

1 comment1 min readLW link

(newsletter.rootsofprogress.org)

Unpacking Jonah Wilberg’s Goddess of Everything Else

StanislavKrym29 Dec 2025 18:25 UTC

6 points

2 comments4 min readLW link

[Book Review] • → 🚹 → •

artdeco29 Dec 2025 17:50 UTC

26 points

5 comments3 min readLW link

How To Create A Lsusr Golem

M_Chouchani29 Dec 2025 17:50 UTC

5 points

0 comments2 min readLW link

Dating Roundup #8: Tactics

Zvi29 Dec 2025 16:40 UTC

61 points

2 comments17 min readLW link

(thezvi.wordpress.com)

Ping pong computation in superposition

Alex Gibson29 Dec 2025 16:31 UTC

13 points

0 comments3 min readLW link

The x-risk case for exercise: to have the most impact, the world needs you at your best

KatWoods29 Dec 2025 15:37 UTC

16 points

1 comment1 min readLW link

Bot Alexander on Hot Zombies and AI Adolescents

future_detective29 Dec 2025 14:52 UTC

−8 points

11 comments25 min readLW link

Defeating Moloch: The view from Evolutionary Game Theory

Jonah Wilberg29 Dec 2025 14:37 UTC

24 points

3 comments8 min readLW link

PrincInt (PIBBSS) Opportunities: Summer Fellowship, Postdoc, and Ops Role (Deadlines in January)

DusanDNesic29 Dec 2025 12:12 UTC

8 points

0 comments1 min readLW link

The Weakest Model in the Selector

Alice Blair29 Dec 2025 6:55 UTC

13 points

6 comments1 min readLW link

Re: “A Brief Rant on the Future of Interaction Design”

Raemon29 Dec 2025 6:35 UTC

56 points

3 comments5 min readLW link

(worrydream.com)

Magic Words and Performative Utterances

Screwtape29 Dec 2025 6:21 UTC

30 points

4 comments4 min readLW link

The pace of progress, 4 years later

Veedrac29 Dec 2025 4:16 UTC

25 points

2 comments6 min readLW link

The CIA Poisoned My Dog: Two Stories About Paranoid Delusions and Damage Control

River29 Dec 2025 3:59 UTC

125 points

2 comments5 min readLW link

How to never make a bad decision

Wes R28 Dec 2025 23:21 UTC

−4 points

0 comments3 min readLW link

Research agenda for training aligned AIs using concave utility functions following the principles of homeostasis and diminishing returns

Roland Pihlakas28 Dec 2025 21:53 UTC

14 points

0 comments8 min readLW link

Training Matching Pursuit SAEs on LLMs

chanind28 Dec 2025 18:57 UTC

19 points

2 comments7 min readLW link

Do LLMs Condition Safety Behaviour on Dialect? Preliminary Evidence

Aakash Rana28 Dec 2025 18:21 UTC

7 points

2 comments5 min readLW link

Meditations on Suffering

MeditationsOnShrimp28 Dec 2025 17:39 UTC

−1 points

0 comments2 min readLW link

November 2025 Links

nomagicpill28 Dec 2025 15:51 UTC

19 points

2 comments7 min readLW link

(nomagicpill.substack.com)

Reviews I: Everyone’s Responsibility

nomagicpill28 Dec 2025 15:48 UTC

2 points

0 comments4 min readLW link

(nomagicpill.substack.com)

Introspection via localization

Victor Godet28 Dec 2025 14:26 UTC

36 points

8 comments3 min readLW link

Crystals in NNs: Technical Companion Piece

Jonas Hallgren28 Dec 2025 10:44 UTC

24 points

5 comments15 min readLW link

Have You Tried Thinking About It As Crystals?

Jonas Hallgren28 Dec 2025 10:44 UTC

77 points

12 comments10 min readLW link

Alignment Is Not One Problem: A 3D Map of AI Risk

Anurag 28 Dec 2025 8:44 UTC

3 points

0 comments14 min readLW link

Orpheus’ Basilisk

pulwat28 Dec 2025 0:43 UTC

22 points

1 comment2 min readLW link

A Conflict Between AI Alignment and Philosophical Competence

Wei Dai27 Dec 2025 21:32 UTC

70 points

14 comments2 min readLW link

Glucose Supplementation for Sustained Stimulant Cognition

Johannes C. Mayer27 Dec 2025 19:58 UTC

34 points

13 comments1 min readLW link

A Brief Proof That You Are Every Conscious Thing

Jason R27 Dec 2025 17:16 UTC

−16 points

15 comments3 min readLW link

Introducing the XLab AI Security Guide

Zephaniah Roe, jcksanderson and Julian H

27 Dec 2025 16:50 UTC

19 points

1 comment5 min readLW link