All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Lorxus Favors: An Experiment in Self-Backed Giftlike Macroeconomics (+ Extra Bits)

Lorxus12 Nov 2025 23:02 UTC

7 points

0 comments8 min readLW link

(tiled-with-pentagons.blogspot.com)

A Timeless Universe Viewed From the Inside

0xA12 Nov 2025 22:32 UTC

1 point

0 comments3 min readLW link

Please, Don’t Roll Your Own Metaethics

Wei Dai12 Nov 2025 22:17 UTC

153 points

68 comments2 min readLW link

A bad review != a bad book

Algon12 Nov 2025 22:05 UTC

9 points

3 comments1 min readLW link

The Pope Offers Wisdom

Zvi12 Nov 2025 21:50 UTC

51 points

3 comments8 min readLW link

(thezvi.wordpress.com)

Why Truth First?

johnswentworth12 Nov 2025 21:45 UTC

51 points

6 comments6 min readLW link

Social drives 2: “Approval Reward”, from norm-enforcement to status-seeking

Steven Byrnes12 Nov 2025 20:40 UTC

42 points

9 comments17 min readLW link

OpenAI Releases GPT 5.1

anaguma12 Nov 2025 20:33 UTC

13 points

1 comment1 min readLW link

(openai.com)

[Question] Is SGD capabilities research positive?

Brendan Long12 Nov 2025 20:32 UTC

7 points

1 comment1 min readLW link

Bitcoin Halvings and the Trisolaran Mistake: When External Actors Masquerade as Natural Laws

Mi12 Nov 2025 20:30 UTC

12 points

0 comments1 min readLW link

Lighthaven-ish Ticket Strategy: Three Pillars of FOMO

JohnofCharleston12 Nov 2025 20:10 UTC

59 points

0 comments5 min readLW link

Personal Account: To the Muck and the Mire

soycarts12 Nov 2025 19:38 UTC

2 points

0 comments1 min readLW link

We live in the luckiest timeline

beyarkay (Boyd Kane)12 Nov 2025 18:59 UTC

2 points

6 comments5 min readLW link

(boydkane.com)

AI for Safety & Science Nodes in Berlin & the Bay Area

Allison Duettmann12 Nov 2025 18:49 UTC

6 points

0 comments2 min readLW link

Reflections on being Sorted

Gordon Seidoh Worley12 Nov 2025 17:40 UTC

23 points

0 comments9 min readLW link

(www.uncertainupdates.com)

Lorxus Does Halfhaven: 11/01~11/07

Lorxus12 Nov 2025 16:43 UTC

9 points

0 comments2 min readLW link

(tiled-with-pentagons.blogspot.com)

Undissolvable Problems: things that still confuse me

Yair Halberstadt12 Nov 2025 16:30 UTC

26 points

22 comments2 min readLW link

Introducing faruvc.org

jefftk12 Nov 2025 16:00 UTC

47 points

10 comments1 min readLW link

(www.jefftk.com)

Warning Aliens About the Dangerous AI We Might Create

James_Miller and avturchin

12 Nov 2025 15:26 UTC

91 points

25 comments5 min readLW link

9+ weeks of mentored AI safety research in London – Pivotal Research Fellowship

Tobias H12 Nov 2025 15:21 UTC

9 points

0 comments2 min readLW link

I Read Red Heart and I Heart It

Taylor G. Lunt12 Nov 2025 14:54 UTC

38 points

16 comments2 min readLW link

Miscellaneous observations about board games

Dentosal12 Nov 2025 12:49 UTC

4 points

0 comments2 min readLW link

Why to Commit to a Writing and Publishing Schedule

dreeves12 Nov 2025 7:35 UTC

10 points

0 comments2 min readLW link

5 Things I Learned After 10 Days of Inkhaven

Ben Pace12 Nov 2025 7:20 UTC

107 points

5 comments3 min readLW link

Do not hand off what you cannot pick up

habryka12 Nov 2025 6:32 UTC

144 points

24 comments4 min readLW link

Better than Baseline

Screwtape12 Nov 2025 6:30 UTC

24 points

1 comment4 min readLW link

How human-like do safe AI motivations need to be?

Joe Carlsmith12 Nov 2025 5:32 UTC

27 points

9 comments52 min readLW link

Teleosemantics & Swampman

abramdemski12 Nov 2025 5:27 UTC

26 points

6 comments5 min readLW link

Response to “Taking AI Welfare Seriously”: The Indirect Approach to Moral Patienthood

Juan Cadile12 Nov 2025 4:43 UTC

12 points

0 comments2 min readLW link

How I Learned That I Don’t Feel Companionate Love

johnswentworth12 Nov 2025 4:18 UTC

115 points

32 comments4 min readLW link

Conceptual reasoning dataset v0.1 available (AI for AI safety/AI for philosophy)

Chi Nguyen, Emery Cooper and Caspar Oesterheld

12 Nov 2025 1:12 UTC

19 points

0 comments3 min readLW link

Fairly Breaking Ties Without Fair Coins

Brendan Long11 Nov 2025 21:48 UTC

11 points

10 comments4 min readLW link

(www.brendanlong.com)

Kimi K2 Thinking

Zvi11 Nov 2025 21:10 UTC

47 points

0 comments5 min readLW link

(thezvi.wordpress.com)

Not-A-Book Review: The Attractive Man (Dating Coach Service)

25Hour11 Nov 2025 20:03 UTC

15 points

0 comments1 min readLW link

(lifeimprovementschemes.substack.com)

Don’t Get One-Shotted

Jordan Rubin11 Nov 2025 17:07 UTC

2 points

2 comments6 min readLW link

(jordanmrubin.substack.com)

Learnings from the Zurich AI Safety Day

MariusWenk, Al-Hussein Saqr and bluedotimpact

11 Nov 2025 17:00 UTC

13 points

0 comments6 min readLW link

Steering Language Models with Weight Arithmetic

Fabien Roger and constanzafierro

11 Nov 2025 16:30 UTC

88 points

6 comments5 min readLW link

Announcing the Society of Teen Scientists

rogersbacon11 Nov 2025 16:08 UTC

8 points

0 comments1 min readLW link

What is Happening in AI Governance?

Alexander Müller and Thomas Vassil Brcic

11 Nov 2025 15:59 UTC

6 points

0 comments5 min readLW link

Human Agency at Stake

Alexander Müller and senyakk

11 Nov 2025 15:57 UTC

8 points

0 comments6 min readLW link

Omniscience one bit at a time: Chapter 3

Dentosal11 Nov 2025 13:34 UTC

2 points

0 comments2 min readLW link

Evolution’s Alignment Solution: Why Burnout Prevents Monsters

Elias_Kunnas11 Nov 2025 13:32 UTC

9 points

0 comments6 min readLW link

Thick practices for AI tools

Alexandre Variengien11 Nov 2025 13:13 UTC

19 points

2 comments20 min readLW link

(alexandrevariengien.com)

The problem of graceful deference

TsviBT11 Nov 2025 8:17 UTC

108 points

41 comments4 min readLW link

See Your Word Count While You Write

dreeves11 Nov 2025 8:02 UTC

7 points

3 comments2 min readLW link

On Stance

Screwtape11 Nov 2025 7:50 UTC

24 points

5 comments6 min readLW link

Breaking the Hedonic Rubber Band

Ben Pace11 Nov 2025 7:00 UTC

20 points

4 comments4 min readLW link

Rejecting “Goodness” Does Not Mean Hammering The Defect Button

johnswentworth11 Nov 2025 6:50 UTC

25 points

6 comments2 min readLW link

Strengthening Red Teams: A Modular Scaffold for Control Evaluations

Chloe Loughridge11 Nov 2025 6:20 UTC

7 points

0 comments1 min readLW link

(alignment.anthropic.com)

On the Normativity of Debate: A Discussion With Said Achmiz

Zack_M_Davis11 Nov 2025 5:49 UTC

21 points

1 comment22 min readLW link