Con­sti­tu­tional AI Alignment

RogerDearnaley27 May 2026 22:29 UTC
27 points
9 comments47 min readLW link

LLMs Through the Eyes of Vinge

Gordon Seidoh Worley27 May 2026 20:20 UTC
52 points
2 comments4 min readLW link
(www.uncertainupdates.com)

Biolog­i­cally Plau­si­ble SGD Is Hard

Elliot Callender27 May 2026 19:34 UTC
9 points
0 comments1 min readLW link

Eval Co­op­er­a­tive­ness May Be a Scal­able Miti­ga­tion for Eval Gaming

27 May 2026 19:33 UTC
73 points
5 comments10 min readLW link
(turntrout.com)

no, Mag­nifica Hu­man­i­tas is not AI-written

bhauth27 May 2026 19:26 UTC
−13 points
18 comments3 min readLW link

Albu­querque ACX Meetup

Mary27 May 2026 18:27 UTC
2 points
0 comments1 min readLW link

Full au­toma­tion of AI R&D prob­a­bly yields a large speed up even with­out a soft­ware-only singularity

ryan_greenblatt27 May 2026 18:16 UTC
67 points
17 comments3 min readLW link

Not Prosthetics

Elliot Callender27 May 2026 17:22 UTC
11 points
0 comments2 min readLW link

BCI Cog­ni­tion En­hance­ment is Possible

Elliot Callender27 May 2026 17:19 UTC
17 points
0 comments1 min readLW link

The bal­lad of TIGIT

Abhishaike Mahajan27 May 2026 17:04 UTC
84 points
1 comment9 min readLW link

Lev­er­ag­ing In­tro­spec­tion for Alignment

Yotam27 May 2026 16:54 UTC
25 points
3 comments7 min readLW link

An­nounc­ing Geodesic Research

27 May 2026 16:40 UTC
74 points
1 comment5 min readLW link

AI as a So­cial Tech­nol­ogy, by Henry Farell

TheManxLoiner27 May 2026 13:41 UTC
15 points
0 comments3 min readLW link
(lovkush.substack.com)

More ca­pa­ble AI, less money raised

Shoshannah Tekofsky27 May 2026 12:57 UTC
28 points
2 comments3 min readLW link
(theaidigest.org)

Quan­ti­ta­tive AI risk as­sess­ment: a start­ing point

27 May 2026 9:42 UTC
38 points
7 comments11 min readLW link
(www.safer-ai.org)

[pa­per] Train­ing on Doc­u­ments About Mon­i­tor­ing Leads to CoT Obfuscation

27 May 2026 9:39 UTC
31 points
1 comment4 min readLW link
(arxiv.org)

No fron­tier model has ac­cept­able lev­els of com­pli­ance with the EU AI Act and pri­vacy leg­is­la­tion.

27 May 2026 7:35 UTC
29 points
0 comments9 min readLW link

Think­ing out­side the box? LLM anal­y­sis of sim­plified co­op­er­a­tive poker

Dentosal27 May 2026 7:28 UTC
15 points
0 comments4 min readLW link

Stan­dard de­vi­a­tions from just two values

kqr27 May 2026 5:01 UTC
41 points
2 comments3 min readLW link
(entropicthoughts.com)

Con­tra Went­worth on Phys­i­cal At­trac­tive­ness for Men

Gretta Duleba26 May 2026 23:20 UTC
123 points
25 comments8 min readLW link

Train­ing Lan­guage Models for Con­trol­led Stochasticity

26 May 2026 22:17 UTC
18 points
0 comments5 min readLW link

Are Mythos’ Cy­ber Ca­pa­bil­ities Over­stated? - Yes and No

Muhan Luo26 May 2026 22:17 UTC
7 points
1 comment10 min readLW link

Should we train LLMs to be hu­man?

Hubert Plisiecki26 May 2026 22:16 UTC
3 points
0 comments2 min readLW link

Steer­ing Direc­tions Are Ex­pla­na­tions, Not Handles

JackYoung2726 May 2026 22:15 UTC
8 points
0 comments7 min readLW link

You Can’t Tell a Con­science From a Leash by Watching

GenericHousewife_B26 May 2026 22:14 UTC
6 points
2 comments3 min readLW link

Find­ing the Mole: Bayesi­anism is Hard

laniakea26 May 2026 21:55 UTC
35 points
0 comments5 min readLW link

Sim­plify­ing Align­ment by Ex­pand­ing Scope

Adam Chlipala26 May 2026 21:42 UTC
3 points
0 comments7 min readLW link

Prac­ti­cal Learn­ings from Syn­thetic Doc­u­ment Finetuning

26 May 2026 19:22 UTC
80 points
6 comments8 min readLW link

Claude, Author of the Humanitas

Linch26 May 2026 16:05 UTC
118 points
42 comments16 min readLW link

When does de­bate help a weak judge? Ev­i­dence from code, logic and math

26 May 2026 14:36 UTC
16 points
4 comments5 min readLW link

ACX At­lanta June 2026 Meetup

Steve French26 May 2026 13:59 UTC
2 points
0 comments1 min readLW link

RTMH: Pope Leo’s Mag­nifica Hu­man­i­tas on AI

Zvi26 May 2026 13:20 UTC
36 points
5 comments29 min readLW link
(thezvi.wordpress.com)

The Fatal AGI Hard­ware Gap

jrincayc26 May 2026 12:55 UTC
4 points
5 comments1 min readLW link

Many por­tions of Mag­nifica Hu­man­i­tas ap­pear to be AI-written

DanielFilan26 May 2026 7:40 UTC
78 points
51 comments6 min readLW link
(danielfilan.com)

Brain trans­fers might be the eas­iest path to life extension

Semi-Pseudonymous26 May 2026 6:23 UTC
11 points
15 comments4 min readLW link

Some Thoughts on Ben­gio’s Scien­tist AI

Matthew Khoriaty26 May 2026 3:05 UTC
21 points
4 comments2 min readLW link

Brack­ets Are a Bad Way to Regulate

Hide26 May 2026 3:01 UTC
75 points
15 comments5 min readLW link
(hidefromit.substack.com)

Donat­ing 80% While It Still Counts

jefftk26 May 2026 1:30 UTC
123 points
8 comments6 min readLW link
(www.jefftk.com)

Notes on Fourier Analysis

Menotim26 May 2026 0:39 UTC
32 points
5 comments23 min readLW link

Im­prov­ing Petri schem­ing au­dits with en­vi­ron­ment blueprints

Jannes Elstner26 May 2026 0:31 UTC
12 points
0 comments6 min readLW link

Pope Leo’s First AI En­cycli­cal – Sum­mary and Commentary

John-Clark Levin25 May 2026 23:48 UTC
26 points
8 comments39 min readLW link

Cog­ni­tive Se­cu­rity as an AI Safety Cause Area

jsteinhardt25 May 2026 18:30 UTC
156 points
18 comments2 min readLW link

Sen­tient Welfare Across Three Futures

MichaelDickens25 May 2026 16:22 UTC
13 points
2 comments2 min readLW link

Linkpost: New Vat­i­can En­cycli­cal on AI Governance

Jackson Wagner25 May 2026 15:40 UTC
58 points
7 comments1 min readLW link

How AI Will Save Pre­dic­tion Markets

alexjaniak25 May 2026 14:24 UTC
11 points
18 comments6 min readLW link
(x.com)

There should be a dis­cus­sion about LW’s policy to al­low calls for violence

Mikhail Samin25 May 2026 13:51 UTC
−5 points
21 comments10 min readLW link

Char­ac­ter-trained mod­els can strug­gle to generalise

Nathaniel Mitrani25 May 2026 12:58 UTC
22 points
4 comments4 min readLW link

Ap­pli­ca­tions open for the Se­cure Pro­gram Syn­the­sis Fellowship

eitan sprejer25 May 2026 10:04 UTC
8 points
0 comments1 min readLW link

An­nounc­ing the Fron­tier Biodefense Fel­low­ship (dead­line 7 June)

Tobias H25 May 2026 7:58 UTC
5 points
0 comments3 min readLW link

Tax­ing Small Cars To Im­prove MPG

jefftk24 May 2026 21:50 UTC
91 points
11 comments2 min readLW link
(www.jefftk.com)