All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 272829 30 31

[Question] Colonialism in space: Does a collection of minds have exactly two attractors?

StanislavKrym27 May 2025 23:35 UTC

7 points

8 comments1 min readLW link

[Question] What are the best arguments you’ve seen for the Litany of Gendlin?

flowerfeatherfocus27 May 2025 21:19 UTC

7 points

8 comments1 min readLW link

What We Learned from Briefing 70+ Lawmakers on the Threat from AI

leticiagarcia27 May 2025 18:23 UTC

495 points

17 comments16 min readLW link

(substack.com)

My script for organizing OBNYC meetups

Orioth27 May 2025 18:14 UTC

3 points

0 comments4 min readLW link

Untrusted AIs can exploit feedback in control protocols

Mia Hopman, BionicD0LPH1N and Tyler Tracy

27 May 2025 16:41 UTC

30 points

0 comments16 min readLW link

Requiem for the hopes of a pre-AI world

Mitchell_Porter27 May 2025 14:47 UTC

97 points

0 comments3 min readLW link

The Best of All Possible Worlds

Jakub Growiec27 May 2025 13:16 UTC

11 points

7 comments49 min readLW link

Dating Roundup #5: Opening Day

Zvi27 May 2025 13:10 UTC

27 points

8 comments27 min readLW link

(thezvi.wordpress.com)

Season Recap of the Village: Agents raise $2,000

Shoshannah Tekofsky27 May 2025 12:34 UTC

135 points

14 comments6 min readLW link

(theaidigest.org)

Beware the Moral Homophone

ymeskhout27 May 2025 12:06 UTC

69 points

4 comments9 min readLW link

(www.ymeskhout.com)

Association taxes are collusion subsidies

KatjaGrace27 May 2025 6:50 UTC

106 points

7 comments1 min readLW link

(worldspiritsockpuppet.com)

Creating My Own Winter Solstice Celebration—Southern Hemisphere Edition

joshuamerriam27 May 2025 2:11 UTC

8 points

0 comments2 min readLW link

U.S. Government Seeks Input on National AI R&D Strategic Plan—Deadline May 29

mbrooks27 May 2025 1:57 UTC

17 points

0 comments1 min readLW link

All Rationalists hate & sabotage Strategy without having any awareness of it.

Oxidize26 May 2025 22:09 UTC

−27 points

8 comments7 min readLW link

Personal Ruminations on AI’s Missing Variable Problem

Thehumanproject.ai26 May 2025 21:11 UTC

1 point

0 comments3 min readLW link

Poetic Methods II: Rhyme as a Focusing Device

adamShimi26 May 2025 18:29 UTC

24 points

1 comment17 min readLW link

(formethods.substack.com)

Is Building Good Note-Taking Software an AGI-Complete Problem?

Thane Ruthenis26 May 2025 18:26 UTC

27 points

13 comments7 min readLW link

Principal-Agent Problems and the Structure of Governance

belos26 May 2025 18:23 UTC

1 point

0 comments8 min readLW link

(bestofagreatlot.substack.com)

[Question] Does the Universal Geometry of Embeddings paper have big implications for interpretability?

Evan R. Murphy26 May 2025 18:20 UTC

43 points

6 comments1 min readLW link

Socratic Persuasion: Giving Opinionated Yet Truth-Seeking Advice

Neel Nanda26 May 2025 17:38 UTC

61 points

14 comments21 min readLW link

(www.neelnanda.io)

[Beneath Psychology] Case study on chronic pain: First insights, and the remaining challenge

jimmy26 May 2025 17:29 UTC

14 points

1 comment11 min readLW link

An observation on self-play

jonrxu26 May 2025 17:22 UTC

15 points

1 comment3 min readLW link

New website analyzing AI companies’ model evals

Zach Stein-Perlman26 May 2025 16:00 UTC

58 points

0 comments4 min readLW link

New scorecard evaluating AI companies on safety

Zach Stein-Perlman26 May 2025 16:00 UTC

72 points

8 comments1 min readLW link

[Question] Asking for AI Safety Career Advice

infinibot2726 May 2025 15:26 UTC

3 points

1 comment1 min readLW link

Nerve Blisters: A Stoic Response

Jonathan Moregård26 May 2025 15:07 UTC

8 points

2 comments1 min readLW link

(honestliving.substack.com)

On ‘On Caring’

atharva26 May 2025 13:39 UTC

9 points

4 comments3 min readLW link

Claude 4 You: The Quest for Mundane Utility

Zvi26 May 2025 13:01 UTC

36 points

0 comments17 min readLW link

(thezvi.wordpress.com)

Formalizing Embeddedness Failures in Universal Artificial Intelligence

Cole Wyeth26 May 2025 12:36 UTC

39 points

0 comments1 min readLW link

(arxiv.org)

Techies Wanted: How STEM Backgrounds Can Advance Safe AI Policy

Daniel_Eth26 May 2025 11:29 UTC

16 points

0 comments29 min readLW link

D&D.Sci: The Choosing Ones [Answerkey and Ruleset]

abstractapplic26 May 2025 9:43 UTC

19 points

2 comments3 min readLW link

The Sundog Alignment Theorem: A Proposal for Embodied Alignment via Indirect Inference

Malice26 May 2025 7:26 UTC

−9 points

0 comments3 min readLW link

Superposition Without Compression: Why Entangled Representations Are the Default

James Butterworth26 May 2025 5:26 UTC

3 points

2 comments1 min readLW link

(drive.google.com)

Long-form data bottlenecks might stall AI progress for years

Michelle_Ma26 May 2025 4:36 UTC

21 points

0 comments13 min readLW link

Example of Splitting a PR

jefftk26 May 2025 2:20 UTC

28 points

0 comments2 min readLW link

(www.jefftk.com)

How I’m telling my friends about AI Safety

k6425 May 2025 22:43 UTC

1 point

7 comments7 min readLW link

Good Writing

Adam Zerner25 May 2025 21:52 UTC

11 points

0 comments2 min readLW link

(paulgraham.com)

Consider buying voting shares

Hruss25 May 2025 18:01 UTC

2 points

3 comments1 min readLW link

[Question] Can you donate to AI advocacy?

k6425 May 2025 17:54 UTC

17 points

4 comments1 min readLW link

Rant: the extreme wastefulness of high rent prices

Knight Lee25 May 2025 17:04 UTC

−2 points

0 comments2 min readLW link

Beyond Democracy: A System Where Citizens Vote with Their Taxes

Brendan Golledge25 May 2025 17:00 UTC

−1 points

3 comments7 min readLW link

Claude 4 You: Safety and Alignment

Zvi25 May 2025 14:00 UTC

86 points

8 comments63 min readLW link

(thezvi.wordpress.com)

Alignment Proposal: Adversarially Robust Augmentation and Distillation

Cole Wyeth and abramdemski

25 May 2025 12:58 UTC

56 points

47 comments13 min readLW link

An open job application to AI labs

Hruss25 May 2025 12:57 UTC

17 points

0 comments1 min readLW link

Meditations on Doge

Martin Sustrik25 May 2025 12:00 UTC

131 points

44 comments9 min readLW link

(250bpm.substack.com)

Case Studies in Simulators and Agents

WillPetillo, Sean Herrington, Spencer Ames, Adebayo Mubarak and Can Narin

25 May 2025 5:40 UTC

15 points

8 comments6 min readLW link

On safety of being a moral patient of ASI

Yaroslav Granowski24 May 2025 21:24 UTC

3 points

8 comments1 min readLW link

We Need a Baseline for LLM-Aided Experiments

J Bostock24 May 2025 20:52 UTC

11 points

1 comment1 min readLW link

Lie Detectors. Technical solutions to the cooperation problem.

Window Frame24 May 2025 20:05 UTC

7 points

0 comments10 min readLW link

It’s hard to make scheming evals look realistic for LLMs

Igor Ivanov and Danil Kadochnikov

24 May 2025 19:17 UTC

150 points

29 comments5 min readLW link