Emma Baker on ADHD

koratkar14 May 2026 23:29 UTC
8 points
2 comments3 min readLW link
(emma00baker.substack.com)

De­sign­ing AI fac­tual claims for “easy ver­ifi­ca­tion”

Raemon14 May 2026 23:23 UTC
33 points
17 comments2 min readLW link

Au­to­mated Align­ment is Harder Than You Think

14 May 2026 22:01 UTC
143 points
7 comments3 min readLW link
(arxiv.org)

2B scor­ing model flags out-of-do­main mis­al­ign­ment, sug­gest­ing spe­cial­ist judges have po­ten­tial for audits

burnssa14 May 2026 20:00 UTC
8 points
0 comments6 min readLW link

The safe-to-dan­ger­ous shift is a fun­da­men­tal prob­lem for eval re­al­ism; but also for mea­sur­ing awareness

14 May 2026 17:05 UTC
59 points
3 comments3 min readLW link

AI #168: Not Lead­ing the Future

Zvi14 May 2026 14:10 UTC
38 points
2 comments45 min readLW link
(thezvi.wordpress.com)

Why En­sur­ing Flour­ish­ing Is Not About Alignment

ofpetro14 May 2026 6:24 UTC
5 points
6 comments35 min readLW link

In­ter­ven­ing on Sparse, An­chored Concepts

Sandy Fraser14 May 2026 4:35 UTC
24 points
3 comments10 min readLW link

Al­gorith­mic Perfection

zw514 May 2026 3:44 UTC
5 points
1 comment2 min readLW link

Models find­ing soft­ware vuln­er­a­bil­ities is not the pri­mary source of cy­ber­se­cu­rity risk

lc14 May 2026 3:39 UTC
310 points
24 comments2 min readLW link

Claude is Now Align­ment-Pretrained

RogerDearnaley13 May 2026 23:19 UTC
87 points
9 comments1 min readLW link
(www.anthropic.com)

MATS Au­tumn 2026 Fel­low­ship Ap­pli­ca­tions Now Open—Ap­ply by June 7

13 May 2026 21:40 UTC
21 points
0 comments2 min readLW link

Build­ing Connections

13 May 2026 20:27 UTC
8 points
0 comments5 min readLW link

A lack of in­tro­spec­tive abil­ity is not a lack of cor­rigi­bil­ity

lc13 May 2026 20:23 UTC
26 points
3 comments1 min readLW link

Cy­ber Lack of Se­cu­rity and AI Governance

Zvi13 May 2026 20:20 UTC
41 points
1 comment16 min readLW link
(thezvi.wordpress.com)

Stick­i­ness in AI Be­hav­ioral Design

James_T13 May 2026 19:55 UTC
10 points
0 comments14 min readLW link
(www.forethought.org)

Pre­dict­ing Rare LLM Failures with 30× Fewer Rollouts

13 May 2026 17:53 UTC
55 points
3 comments5 min readLW link

Most “in­ner work” looks like en­ter­tain­ment.

Chris Lakin13 May 2026 17:51 UTC
48 points
10 comments2 min readLW link

A Re­search Agenda for Se­cret Loyalties

13 May 2026 17:34 UTC
35 points
3 comments3 min readLW link

Apollo Up­date May 2026

Marius Hobbhahn13 May 2026 16:43 UTC
48 points
0 comments1 min readLW link
(www.apolloresearch.ai)

The case for fine-grained track­ing of com­pute for AI

13 May 2026 16:00 UTC
36 points
17 comments9 min readLW link
(forum.effectivealtruism.org)

Vibe Ex­cel and the Fu­ture of White-Col­lar Work

ykevinzhang13 May 2026 15:39 UTC
13 points
5 comments6 min readLW link

“Com­mu­nity or­ga­nizer” is a dou­ble oxymoron

jchan13 May 2026 15:10 UTC
5 points
13 comments5 min readLW link

Vot­ers are sur­pris­ingly open to talk­ing about AI risk

less_raichu13 May 2026 14:08 UTC
117 points
11 comments3 min readLW link

Civ­i­liza­tion as a tower of holes

Joe Rogero13 May 2026 13:48 UTC
24 points
3 comments4 min readLW link
(subatomicarticles.com)

Ap­pli­ca­tions Open for Im­pact Ac­cel­er­a­tor Program

High Impact Professionals13 May 2026 8:35 UTC
6 points
0 comments1 min readLW link

Epistemic Im­mun­ode­pres­sion in the Age of AI

Tuyen Tran13 May 2026 5:49 UTC
15 points
5 comments2 min readLW link

Lorxus Does Bud­get Inkhaven Again: 4/​29, 4/​30, High­lights, Postmortem

Lorxus13 May 2026 1:37 UTC
15 points
0 comments3 min readLW link
(tiled-with-pentagons.blogspot.com)

Guessti­mate For Pre­dic­tion Mar­ket Returns

DirectedEvolution12 May 2026 23:13 UTC
10 points
0 comments1 min readLW link

Prob­a­bil­is­tic, Re­for­ma­tive Justice

Leo Schmidt-Traub12 May 2026 22:41 UTC
2 points
0 comments3 min readLW link

This is a Dat­ing Ad

Xger3112 May 2026 22:37 UTC
−17 points
6 comments3 min readLW link

Re­in­force­ment Learn­ing, Agency and Taste

epicurus12 May 2026 18:22 UTC
7 points
0 comments9 min readLW link

Child­hood and Ed­u­ca­tion #18: Do The Math

Zvi12 May 2026 18:20 UTC
56 points
11 comments13 min readLW link
(thezvi.wordpress.com)

The Owned Ones

Eliezer Yudkowsky12 May 2026 17:56 UTC
369 points
51 comments6 min readLW link

Sig­nal­ing and Per­verse Adop­tion of Ex­pen­sive AI

Adam Chlipala12 May 2026 14:34 UTC
−21 points
2 comments8 min readLW link

On Hav­ing Good Hot Takes

Celer12 May 2026 14:20 UTC
9 points
2 comments8 min readLW link
(keller.substack.com)

Op­ti­mi­sa­tion: Selec­tive ver­sus Predictive

Raymond Douglas12 May 2026 14:03 UTC
117 points
15 comments3 min readLW link

The Lies and Fal­la­cies of the Buyer and Seller

Hide12 May 2026 11:26 UTC
28 points
18 comments16 min readLW link
(hidefromit.substack.com)

When should an AI in­ci­dent trig­ger an in­ter­na­tional re­sponse? Cri­te­ria for in­ter­na­tional es­ca­la­tion and im­pli­ca­tions for the de­sign of AI in­ci­dent frameworks

12 May 2026 8:52 UTC
13 points
0 comments4 min readLW link

Ver­bal­ised eval­u­a­tion aware­ness in lan­guage mod­els has lit­tle effect on their behaviour

12 May 2026 5:36 UTC
19 points
1 comment6 min readLW link

The ter­rible weight of see­ing the board

philosophybear12 May 2026 5:13 UTC
1 point
8 comments9 min readLW link

Fibonacci Struc­ture in Har­monic Series Partitions

Avyukth Nilajagi12 May 2026 4:26 UTC
5 points
1 comment2 min readLW link

Hedg­ing global oil sup­ply shocks?

Nicholas Kross12 May 2026 1:37 UTC
14 points
2 comments1 min readLW link

Our ex­pe­rience of the first re­search in a pro­ject in­cu­ba­tor: much more than you wanted to know

11 May 2026 20:28 UTC
7 points
0 comments10 min readLW link

I don’t have ques­tions: how a good Jewish boy turns atheist

Semi-Pseudonymous11 May 2026 20:11 UTC
22 points
4 comments6 min readLW link

Fore­sight In­sti­tute Work­shop (Ber­lin): Boot­strap­ping Re­search Agents — Hands-On for Scientists

morisil11 May 2026 20:11 UTC
1 point
0 comments1 min readLW link

Ex­pe­rience Re­port: ML4Good AI Gover­nance Boot­camp,Lyon,May 2026

Rohit Mehdiratta11 May 2026 20:05 UTC
0 points
0 comments3 min readLW link

[Aca­demic ques­tion­naire] Hu­man rea­son­ing in so­cial de­duc­tion games vs. LLM rea­son­ing.

atuin11 May 2026 20:01 UTC
1 point
0 comments1 min readLW link

Where are all the De­ci­sion Mar­kets?

alexjaniak11 May 2026 19:48 UTC
13 points
3 comments3 min readLW link

RFDiffu­sion3: A Brief Exploration

michaelwaves11 May 2026 19:26 UTC
3 points
0 comments5 min readLW link