Ab­strac­tion Boundaries and Bub­bles of Legibility

Adam Chlipala2 Jun 2026 23:54 UTC
1 point
0 comments9 min readLW link

Should AI Safety Re­searchers Ex­per­i­ment with Au­to­mated Research

Ephraiem Sarabamoun2 Jun 2026 23:18 UTC
1 point
0 comments1 min readLW link

My fa­vorite de­pic­tion of utopia

Caleb Biddulph2 Jun 2026 23:15 UTC
189 points
20 comments33 min readLW link
(docs.google.com)

The Ori­gin of Uncertainty

Gordon Seidoh Worley2 Jun 2026 18:20 UTC
13 points
2 comments2 min readLW link
(www.uncertainupdates.com)

LURE: Align­ment Eval­u­a­tions to Re­duce Eval­u­a­tion Awareness

2 Jun 2026 18:20 UTC
26 points
5 comments5 min readLW link

Why Even Ex­perts Don’t Know What to Do About AI Risk

2 Jun 2026 17:31 UTC
78 points
22 comments2 min readLW link

Where does the race to au­to­mate AI re­search end?

Simon Lermen2 Jun 2026 17:21 UTC
16 points
0 comments1 min readLW link
(simonlermen.substack.com)

A Town Without Children

SeñorDingDong2 Jun 2026 16:35 UTC
35 points
7 comments4 min readLW link

An­nounc­ing the ARC White-Box Es­ti­ma­tion Challenge

2 Jun 2026 16:20 UTC
165 points
15 comments3 min readLW link
(www.alignment.org)

Agent Foun­da­tions Re­minds Me of Con­ti­nen­tal Philosophy

IanWS2 Jun 2026 14:34 UTC
106 points
15 comments5 min readLW link
(write.ianwsperber.com)

Claude Opus 4.8: Ca­pa­bil­ities and Reactions

Zvi2 Jun 2026 14:10 UTC
38 points
2 comments31 min readLW link
(thezvi.wordpress.com)

Why we’re launch­ing the Fron­tier Biodefense Fellowship

Tobias H2 Jun 2026 9:06 UTC
8 points
0 comments4 min readLW link

Wood Screws and the Meth­ods of Rationality

quanticle2 Jun 2026 7:49 UTC
12 points
7 comments4 min readLW link

Tak­ing the Train­ing Wheels Off: Align­ing LLMs with­out Personas

Matthew Khoriaty2 Jun 2026 6:29 UTC
23 points
16 comments3 min readLW link

Com­pute Ver­ifi­ca­tion on Short Timelines

skunnavakkam2 Jun 2026 3:31 UTC
13 points
0 comments2 min readLW link

Test­ing Best-Effort Solar

jefftk2 Jun 2026 3:00 UTC
16 points
0 comments2 min readLW link
(www.jefftk.com)

May 2026 Links

nomagicpill2 Jun 2026 1:42 UTC
8 points
0 comments4 min readLW link

% Bureaucracy

PossiblyElaine2 Jun 2026 0:36 UTC
11 points
1 comment5 min readLW link
(possiblyelaine.substack.com)

Tech I’m skep­ti­cal of and why

harsimony1 Jun 2026 22:54 UTC
46 points
24 comments24 min readLW link
(splittinginfinity.substack.com)

Cri­tique of cur­rent AI safety bug bounty programs

clickyquack1 Jun 2026 21:26 UTC
7 points
0 comments7 min readLW link

[Linkpost] Pre­fix­ing names with ‘se­cure_’ makes agents write more se­cure code

Jack1 Jun 2026 21:20 UTC
14 points
1 comment1 min readLW link
(antimemeticai.com)

Can LLMs even teach? Ex­plor­ing the Teacher Axis

Vidya Ganga1 Jun 2026 21:16 UTC
15 points
0 comments15 min readLW link

Mon­i­tor­ing com­puter-use agents in OSWorld-control

Chris Harig1 Jun 2026 21:08 UTC
8 points
0 comments5 min readLW link

Align­ment to What?

Hagen B1 Jun 2026 21:08 UTC
2 points
6 comments12 min readLW link

Pop­pe­ri­ans, Bayesi­ans and Ramseyians

Ramseyian1 Jun 2026 21:06 UTC
13 points
0 comments4 min readLW link

Sys­tems Dy­nam­ics Model for Paus­ing AI

1 Jun 2026 20:58 UTC
13 points
0 comments7 min readLW link

Com­pan­ions aren’t Coaches: AIs’ Effect on So­cial Skills

Matt Vincent1 Jun 2026 19:30 UTC
15 points
2 comments5 min readLW link

“Con­ta­gious Hum­ming” to Silence a Room

JohnofCharleston1 Jun 2026 19:08 UTC
90 points
20 comments2 min readLW link

Dis­solv­ing the Deep Learn­ing Sam­ple Effi­ciency Gap

Samuel Knoche1 Jun 2026 18:44 UTC
124 points
24 comments17 min readLW link
(theraptureofthenerds.substack.com)

We Need Breadth-First AI Safety Plans

MichaelDickens1 Jun 2026 17:36 UTC
35 points
2 comments4 min readLW link

C-Zom­bie Uploads Down­load­ing Them­selves & Believ­ing You’re A C-Zombie

Mati_Roy1 Jun 2026 16:49 UTC
5 points
35 comments2 min readLW link
(matiroy.substack.com)

NYT: Se­na­tor San­ders Pro­poses Gov’t Take 50% Own­er­ship of AI Labs

Julian Bradshaw1 Jun 2026 16:13 UTC
47 points
11 comments1 min readLW link
(www.nytimes.com)

Opus 4.8 Part 2: Model Welfare

Zvi1 Jun 2026 15:11 UTC
55 points
1 comment25 min readLW link
(thezvi.wordpress.com)

The re­mark­able story of AIGS Canada

Wyatt Tessari L'Allié1 Jun 2026 14:07 UTC
12 points
0 comments10 min readLW link

Su­per­in­tel­li­gence of the gaps

vals tutor1 Jun 2026 13:00 UTC
5 points
11 comments1 min readLW link

Lean, not backpressure

kqr1 Jun 2026 7:57 UTC
18 points
1 comment1 min readLW link
(entropicthoughts.com)

Some hu­mans are both male and fe­male, and can (but shouldn’t) have chil­dren with themselves

HedonicEscalator1 Jun 2026 1:51 UTC
68 points
14 comments6 min readLW link
(hedonicescalator.substack.com)

My re­ac­tions to “I un­der­es­ti­mated AI ca­pa­bil­ities (again)”

Troy Tian1 Jun 2026 0:58 UTC
3 points
0 comments1 min readLW link

Lizard­men are Not Con­stant—A In­tro­duc­tory Primer to Think­ing about Sur­vey Data

DanielW1 Jun 2026 0:28 UTC
21 points
3 comments9 min readLW link

“This Hy­po­thet­i­cal is Un­re­al­is­tic” is not a Valid Ob­jec­tion

Hide1 Jun 2026 0:02 UTC
0 points
9 comments4 min readLW link
(hidefromit.substack.com)

NLA Thought Anchors

Realmbird31 May 2026 23:38 UTC
10 points
3 comments4 min readLW link

Lighthaven East—A Fea­si­bil­ity Study

JohnofCharleston31 May 2026 22:53 UTC
218 points
46 comments20 min readLW link

Bar­ri­ers to a Pros­per­ous Future

AJ Weeks31 May 2026 21:34 UTC
8 points
0 comments6 min readLW link
(ajweeks.com)

Notes on axes of vari­a­tion in third-party risk assessment

Buck31 May 2026 20:48 UTC
38 points
2 comments10 min readLW link

The main im­pact from au­to­mated AI pro­duc­tion: con­cen­tra­tion of power?

Oliver Sourbut31 May 2026 20:42 UTC
20 points
2 comments7 min readLW link
(www.oliversourbut.net)

A Song About No

jefftk31 May 2026 20:40 UTC
14 points
1 comment1 min readLW link
(www.jefftk.com)

Fi­nan­cial Costs of an AI Pause?

PeterMcCluskey31 May 2026 18:55 UTC
66 points
10 comments6 min readLW link
(bayesianinvestor.com)

Links #2: 2026/​05 Part 2

papetoast31 May 2026 13:41 UTC
8 points
0 comments20 min readLW link

Outrun­ning your headlights

mattshu041031 May 2026 10:42 UTC
41 points
3 comments3 min readLW link

Brain­ing World Models; Pre­dict­ing La­tent Struc­ture via EEG

Raghul Chandramouli31 May 2026 10:41 UTC
1 point
0 comments5 min readLW link
(brain-jepa)