[Question] Colo­nial­ism in space: Does a col­lec­tion of minds have ex­actly two at­trac­tors?

StanislavKrym27 May 2025 23:35 UTC
5 points
8 comments1 min readLW link

[Question] What are the best ar­gu­ments you’ve seen for the Li­tany of Gendlin?

flowerfeatherfocus27 May 2025 21:19 UTC
7 points
8 comments1 min readLW link

What We Learned from Briefing 70+ Law­mak­ers on the Threat from AI

leticiagarcia27 May 2025 18:23 UTC
487 points
17 comments16 min readLW link
(substack.com)

My script for or­ga­niz­ing OBNYC meetups

Orioth27 May 2025 18:14 UTC
3 points
0 comments4 min readLW link

Un­trusted AIs can ex­ploit feed­back in con­trol protocols

27 May 2025 16:41 UTC
30 points
0 comments16 min readLW link

Re­quiem for the hopes of a pre-AI world

Mitchell_Porter27 May 2025 14:47 UTC
73 points
0 comments3 min readLW link

The Best of All Pos­si­ble Worlds

Jakub Growiec27 May 2025 13:16 UTC
11 points
7 comments49 min readLW link

Dat­ing Roundup #5: Open­ing Day

Zvi27 May 2025 13:10 UTC
27 points
8 comments27 min readLW link
(thezvi.wordpress.com)

Sea­son Re­cap of the Village: Agents raise $2,000

Shoshannah Tekofsky27 May 2025 12:34 UTC
135 points
14 comments6 min readLW link
(theaidigest.org)

Be­ware the Mo­ral Homophone

ymeskhout27 May 2025 12:06 UTC
64 points
4 comments9 min readLW link
(www.ymeskhout.com)

As­so­ci­a­tion taxes are col­lu­sion subsidies

KatjaGrace27 May 2025 6:50 UTC
105 points
7 comments1 min readLW link
(worldspiritsockpuppet.com)

Creat­ing My Own Win­ter Sols­tice Cel­e­bra­tion—South­ern Hemi­sphere Edition

joshuamerriam27 May 2025 2:11 UTC
7 points
0 comments2 min readLW link

U.S. Govern­ment Seeks In­put on Na­tional AI R&D Strate­gic Plan—Dead­line May 29

mbrooks27 May 2025 1:57 UTC
17 points
0 comments1 min readLW link

All Ra­tion­al­ists hate & sab­o­tage Strat­egy with­out hav­ing any aware­ness of it.

Oxidize26 May 2025 22:09 UTC
−27 points
8 comments7 min readLW link

Per­sonal Ru­mi­na­tions on AI’s Miss­ing Vari­able Problem

Thehumanproject.ai26 May 2025 21:11 UTC
1 point
0 comments3 min readLW link

Poetic Meth­ods II: Rhyme as a Fo­cus­ing Device

adamShimi26 May 2025 18:29 UTC
24 points
1 comment17 min readLW link
(formethods.substack.com)

Is Build­ing Good Note-Tak­ing Soft­ware an AGI-Com­plete Prob­lem?

Thane Ruthenis26 May 2025 18:26 UTC
26 points
13 comments7 min readLW link

Prin­ci­pal-Agent Prob­lems and the Struc­ture of Governance

belos26 May 2025 18:23 UTC
1 point
0 comments8 min readLW link
(bestofagreatlot.substack.com)

[Question] Does the Univer­sal Geom­e­try of Embed­dings pa­per have big im­pli­ca­tions for in­ter­pretabil­ity?

Evan R. Murphy26 May 2025 18:20 UTC
43 points
6 comments1 min readLW link

So­cratic Per­sua­sion: Giv­ing Opinionated Yet Truth-Seek­ing Advice

Neel Nanda26 May 2025 17:38 UTC
61 points
14 comments21 min readLW link
(www.neelnanda.io)

[Be­neath Psy­chol­ogy] Case study on chronic pain: First in­sights, and the re­main­ing challenge

jimmy26 May 2025 17:29 UTC
12 points
0 comments11 min readLW link

An ob­ser­va­tion on self-play

jonrxu26 May 2025 17:22 UTC
15 points
1 comment3 min readLW link

New web­site an­a­lyz­ing AI com­pa­nies’ model evals

Zach Stein-Perlman26 May 2025 16:00 UTC
58 points
0 comments4 min readLW link

New score­card eval­u­at­ing AI com­pa­nies on safety

Zach Stein-Perlman26 May 2025 16:00 UTC
72 points
8 comments1 min readLW link

[Question] Ask­ing for AI Safety Ca­reer Advice

infinibot2726 May 2025 15:26 UTC
3 points
1 comment1 min readLW link

Nerve Blisters: A Stoic Response

Jonathan Moregård26 May 2025 15:07 UTC
8 points
2 comments1 min readLW link
(honestliving.substack.com)

On ‘On Car­ing’

atharva26 May 2025 13:39 UTC
9 points
4 comments3 min readLW link

Claude 4 You: The Quest for Mun­dane Utility

Zvi26 May 2025 13:01 UTC
36 points
0 comments17 min readLW link
(thezvi.wordpress.com)

For­mal­iz­ing Embed­ded­ness Failures in Univer­sal Ar­tifi­cial Intelligence

Cole Wyeth26 May 2025 12:36 UTC
39 points
0 comments1 min readLW link
(arxiv.org)

Techies Wanted: How STEM Back­grounds Can Ad­vance Safe AI Policy

Daniel_Eth26 May 2025 11:29 UTC
16 points
0 comments29 min readLW link

D&D.Sci: The Choos­ing Ones [An­swerkey and Rule­set]

abstractapplic26 May 2025 9:43 UTC
19 points
2 comments3 min readLW link

The Sun­dog Align­ment The­o­rem: A Pro­posal for Em­bod­ied Align­ment via Indi­rect Inference

Malice26 May 2025 7:26 UTC
−9 points
0 comments3 min readLW link

Su­per­po­si­tion Without Com­pres­sion: Why En­tan­gled Rep­re­sen­ta­tions Are the Default

James Butterworth26 May 2025 5:26 UTC
3 points
2 comments1 min readLW link
(drive.google.com)

Seek­ing Feed­back: Toy Model of De­cep­tive Align­ment (Game The­ory)

Alex Boche26 May 2025 5:23 UTC
5 points
6 comments5 min readLW link

Long-form data bot­tle­necks might stall AI progress for years

Michelle_Ma26 May 2025 4:36 UTC
21 points
0 comments13 min readLW link

Ex­am­ple of Split­ting a PR

jefftk26 May 2025 2:20 UTC
28 points
0 comments2 min readLW link
(www.jefftk.com)

How I’m tel­ling my friends about AI Safety

k6425 May 2025 22:43 UTC
1 point
7 comments7 min readLW link

Good Writing

Adam Zerner25 May 2025 21:52 UTC
11 points
0 comments2 min readLW link
(paulgraham.com)

Con­sider buy­ing vot­ing shares

Hruss25 May 2025 18:01 UTC
2 points
3 comments1 min readLW link

[Question] Can you donate to AI ad­vo­cacy?

k6425 May 2025 17:54 UTC
17 points
4 comments1 min readLW link

Rant: the ex­treme waste­ful­ness of high rent prices

Knight Lee25 May 2025 17:04 UTC
−2 points
0 comments2 min readLW link

Beyond Democ­racy: A Sys­tem Where Ci­ti­zens Vote with Their Taxes

Brendan Golledge25 May 2025 17:00 UTC
−1 points
3 comments7 min readLW link

Claude 4 You: Safety and Alignment

Zvi25 May 2025 14:00 UTC
86 points
8 comments63 min readLW link
(thezvi.wordpress.com)

Align­ment Pro­posal: Ad­ver­sar­i­ally Ro­bust Aug­men­ta­tion and Distillation

25 May 2025 12:58 UTC
56 points
47 comments13 min readLW link

An open job ap­pli­ca­tion to AI labs

Hruss25 May 2025 12:57 UTC
17 points
0 comments1 min readLW link

Med­i­ta­tions on Doge

Martin Sustrik25 May 2025 12:00 UTC
131 points
44 comments9 min readLW link
(250bpm.substack.com)

Case Stud­ies in Si­mu­la­tors and Agents

25 May 2025 5:40 UTC
12 points
8 comments6 min readLW link

On safety of be­ing a moral pa­tient of ASI

Yaroslav Granowski24 May 2025 21:24 UTC
3 points
8 comments1 min readLW link

We Need a Baseline for LLM-Aided Experiments

J Bostock24 May 2025 20:52 UTC
11 points
1 comment1 min readLW link

Lie De­tec­tors. Tech­ni­cal solu­tions to the co­op­er­a­tion prob­lem.

Window Frame24 May 2025 20:05 UTC
6 points
0 comments10 min readLW link