[Question] Colo­nial­ism in space: Does a col­lec­tion of minds have ex­actly two at­trac­tors?

StanislavKrymMay 27, 2025, 11:35 PM
3 points
5 comments1 min readLW link

[Question] What are the best ar­gu­ments you’ve seen for the Li­tany of Gendlin?

flowerfeatherfocusMay 27, 2025, 9:19 PM
5 points
3 comments1 min readLW link

What We Learned from Briefing 70+ Law­mak­ers on the Threat from AI

leticiagarciaMay 27, 2025, 6:23 PM
452 points
14 comments16 min readLW link
(substack.com)

My script for or­ga­niz­ing OBNYC meetups

OriothMay 27, 2025, 6:14 PM
3 points
0 comments4 min readLW link

Un­trusted AIs can ex­ploit feed­back in con­trol protocols

May 27, 2025, 4:41 PM
26 points
0 comments16 min readLW link

Re­quiem for the hopes of a pre-AI world

Mitchell_PorterMay 27, 2025, 2:47 PM
68 points
0 comments3 min readLW link

The Best of All Pos­si­ble Worlds

Jakub GrowiecMay 27, 2025, 1:16 PM
11 points
7 comments49 min readLW link

Dat­ing Roundup #5: Open­ing Day

ZviMay 27, 2025, 1:10 PM
26 points
8 comments27 min readLW link
(thezvi.wordpress.com)

Sea­son Re­cap of the Village: Agents raise $2,000

Shoshannah TekofskyMay 27, 2025, 12:34 PM
126 points
14 comments6 min readLW link
(theaidigest.org)

Be­ware the Mo­ral Homophone

ymeskhoutMay 27, 2025, 12:06 PM
63 points
4 comments9 min readLW link
(www.ymeskhout.com)

As­so­ci­a­tion taxes are col­lu­sion subsidies

KatjaGraceMay 27, 2025, 6:50 AM
102 points
7 comments1 min readLW link
(worldspiritsockpuppet.com)

Creat­ing My Own Win­ter Sols­tice Cel­e­bra­tion—South­ern Hemi­sphere Edition

joshuamerriamMay 27, 2025, 2:11 AM
5 points
0 comments2 min readLW link

U.S. Govern­ment Seeks In­put on Na­tional AI R&D Strate­gic Plan—Dead­line May 29

mbrooksMay 27, 2025, 1:57 AM
17 points
0 comments1 min readLW link

All Ra­tion­al­ists hate & sab­o­tage Strat­egy with­out hav­ing any aware­ness of it.

OxidizeMay 26, 2025, 10:09 PM
−27 points
8 comments7 min readLW link

Per­sonal Ru­mi­na­tions on AI’s Miss­ing Vari­able Problem

Thehumanproject.aiMay 26, 2025, 9:11 PM
1 point
0 comments3 min readLW link

Poetic Meth­ods II: Rhyme as a Fo­cus­ing Device

adamShimiMay 26, 2025, 6:29 PM
24 points
1 comment17 min readLW link
(formethods.substack.com)

Is Build­ing Good Note-Tak­ing Soft­ware an AGI-Com­plete Prob­lem?

Thane RuthenisMay 26, 2025, 6:26 PM
25 points
13 comments7 min readLW link

Prin­ci­pal-Agent Prob­lems and the Struc­ture of Governance

belosMay 26, 2025, 6:23 PM
1 point
0 comments8 min readLW link
(bestofagreatlot.substack.com)

[Question] Does the Univer­sal Geom­e­try of Embed­dings pa­per have big im­pli­ca­tions for in­ter­pretabil­ity?

Evan R. MurphyMay 26, 2025, 6:20 PM
42 points
3 comments1 min readLW link

So­cratic Per­sua­sion: Giv­ing Opinionated Yet Truth-Seek­ing Advice

Neel NandaMay 26, 2025, 5:38 PM
56 points
13 comments21 min readLW link
(www.neelnanda.io)

[Be­neath Psy­chol­ogy] Case study on chronic pain: First in­sights, and the re­main­ing challenge

jimmyMay 26, 2025, 5:29 PM
8 points
0 comments11 min readLW link

An ob­ser­va­tion on self-play

jonrxuMay 26, 2025, 5:22 PM
14 points
1 comment3 min readLW link

New web­site an­a­lyz­ing AI com­pa­nies’ model evals

Zach Stein-PerlmanMay 26, 2025, 4:00 PM
58 points
0 comments4 min readLW link

New score­card eval­u­at­ing AI com­pa­nies on safety

Zach Stein-PerlmanMay 26, 2025, 4:00 PM
72 points
8 comments1 min readLW link

[Question] Ask­ing for AI Safety Ca­reer Advice

infinibot27May 26, 2025, 3:26 PM
3 points
1 comment1 min readLW link

Nerve Blisters: A Stoic Response

Jonathan MoregårdMay 26, 2025, 3:07 PM
8 points
2 comments1 min readLW link
(honestliving.substack.com)

On ‘On Car­ing’

atharvaMay 26, 2025, 1:39 PM
8 points
4 comments3 min readLW link

Claude 4 You: The Quest for Mun­dane Utility

ZviMay 26, 2025, 1:01 PM
36 points
0 comments17 min readLW link
(thezvi.wordpress.com)

For­mal­iz­ing Embed­ded­ness Failures in Univer­sal Ar­tifi­cial Intelligence

Cole WyethMay 26, 2025, 12:36 PM
39 points
0 comments1 min readLW link
(arxiv.org)

Techies Wanted: How STEM Back­grounds Can Ad­vance Safe AI Policy

Daniel_EthMay 26, 2025, 11:29 AM
16 points
0 comments29 min readLW link

D&D.Sci: The Choos­ing Ones [An­swerkey and Rule­set]

abstractapplicMay 26, 2025, 9:43 AM
19 points
2 comments3 min readLW link

The Sun­dog Align­ment The­o­rem: A Pro­posal for Em­bod­ied Align­ment via Indi­rect Inference

MaliceMay 26, 2025, 7:26 AM
−9 points
0 comments3 min readLW link

Su­per­po­si­tion Without Com­pres­sion: Why En­tan­gled Rep­re­sen­ta­tions Are the Default

James ButterworthMay 26, 2025, 5:26 AM
3 points
2 comments1 min readLW link
(drive.google.com)

Seek­ing Feed­back: Toy Model of De­cep­tive Align­ment (Game The­ory)

Alex BocheMay 26, 2025, 5:23 AM
5 points
4 comments5 min readLW link

Long-form data bot­tle­necks might stall AI progress for years

Michelle_MaMay 26, 2025, 4:36 AM
19 points
0 comments13 min readLW link

Ex­am­ple of Split­ting a PR

jefftkMay 26, 2025, 2:20 AM
28 points
0 comments2 min readLW link
(www.jefftk.com)

How I’m tel­ling my friends about AI Safety

k64May 25, 2025, 10:43 PM
1 point
7 comments7 min readLW link

Good Writing

Adam ZernerMay 25, 2025, 9:52 PM
11 points
0 comments2 min readLW link
(paulgraham.com)

Con­sider buy­ing vot­ing shares

HrussMay 25, 2025, 6:01 PM
2 points
3 comments1 min readLW link

[Question] Can you donate to AI ad­vo­cacy?

k64May 25, 2025, 5:54 PM
17 points
4 comments1 min readLW link

Rant: the ex­treme waste­ful­ness of high rent prices

Knight LeeMay 25, 2025, 5:04 PM
−2 points
0 comments2 min readLW link

Beyond Democ­racy: A Sys­tem Where Ci­ti­zens Vote with Their Taxes

Brendan GolledgeMay 25, 2025, 5:00 PM
−1 points
3 comments7 min readLW link

Claude 4 You: Safety and Alignment

Zvi25 May 2025 14:00 UTC
86 points
8 comments63 min readLW link
(thezvi.wordpress.com)

Align­ment Pro­posal: Ad­ver­sar­i­ally Ro­bust Aug­men­ta­tion and Distillation

25 May 2025 12:58 UTC
54 points
47 comments13 min readLW link

An open job ap­pli­ca­tion to AI labs

Hruss25 May 2025 12:57 UTC
15 points
0 comments1 min readLW link

Med­i­ta­tions on Doge

Martin Sustrik25 May 2025 12:00 UTC
129 points
44 comments9 min readLW link
(250bpm.substack.com)

Case Stud­ies in Si­mu­la­tors and Agents

25 May 2025 5:40 UTC
11 points
8 comments6 min readLW link

On safety of be­ing a moral pa­tient of ASI

Yaroslav Granowski24 May 2025 21:24 UTC
3 points
8 comments1 min readLW link

We Need a Baseline for LLM-Aided Experiments

J Bostock24 May 2025 20:52 UTC
11 points
1 comment1 min readLW link

Lie De­tec­tors. Tech­ni­cal solu­tions to the co­op­er­a­tion prob­lem.

Window Frame24 May 2025 20:05 UTC
6 points
0 comments10 min readLW link