NLA Thought Anchors

Realmbird31 May 2026 23:38 UTC
10 points
3 comments4 min readLW link

Lighthaven East—A Fea­si­bil­ity Study

JohnofCharleston31 May 2026 22:53 UTC
218 points
46 comments20 min readLW link

Bar­ri­ers to a Pros­per­ous Future

AJ Weeks31 May 2026 21:34 UTC
8 points
0 comments6 min readLW link
(ajweeks.com)

Notes on axes of vari­a­tion in third-party risk assessment

Buck31 May 2026 20:48 UTC
38 points
2 comments10 min readLW link

The main im­pact from au­to­mated AI pro­duc­tion: con­cen­tra­tion of power?

Oliver Sourbut31 May 2026 20:42 UTC
20 points
2 comments7 min readLW link
(www.oliversourbut.net)

A Song About No

jefftk31 May 2026 20:40 UTC
14 points
1 comment1 min readLW link
(www.jefftk.com)

Fi­nan­cial Costs of an AI Pause?

PeterMcCluskey31 May 2026 18:55 UTC
66 points
10 comments6 min readLW link
(bayesianinvestor.com)

Links #2: 2026/​05 Part 2

papetoast31 May 2026 13:41 UTC
8 points
0 comments20 min readLW link

Outrun­ning your headlights

mattshu041031 May 2026 10:42 UTC
41 points
3 comments3 min readLW link

Brain­ing World Models; Pre­dict­ing La­tent Struc­ture via EEG

Raghul Chandramouli31 May 2026 10:41 UTC
1 point
0 comments5 min readLW link
(brain-jepa)

Ab­sten­tion Geom­e­try: Knowl­edge and Be­havi­our Are Dis­so­cia­ble in Llama 3.1 8B

AdeOlu31 May 2026 10:41 UTC
1 point
0 comments9 min readLW link

Food, wa­ter and power from thin desert air

Bruce Middleton31 May 2026 8:28 UTC
19 points
2 comments1 min readLW link

Why AI safety re­searchers should con­sider a con­tract re­search man­ager position

Mikhail Mironov31 May 2026 8:27 UTC
7 points
0 comments3 min readLW link

Vi­su­al­ize Cycli­cal Struc­ture in Llama Model

Talib Mirza31 May 2026 8:27 UTC
3 points
0 comments2 min readLW link

Fea­tures of SAEs are uni­ver­sal—but only up to an un­known ran­dom rotation

Jordan McCann31 May 2026 8:27 UTC
9 points
0 comments10 min readLW link

Agri­cul­ture needs an­other revolution

Dinesh Natesan31 May 2026 8:26 UTC
1 point
4 comments3 min readLW link

Why I think evals are pretty im­por­tant and most worth work­ing on (for me)

Troy Tian31 May 2026 8:26 UTC
7 points
4 comments1 min readLW link

Fun­da­men­tal Uncer­tainty: Alter­nate Frame­work and Poin­t­wise Reduction

StanislavKrym31 May 2026 3:47 UTC
12 points
3 comments8 min readLW link

Tween Con­tra Dance

jefftk31 May 2026 2:00 UTC
20 points
0 comments2 min readLW link
(www.jefftk.com)

Ensem­ble mon­i­tor­ing for AI con­trol: di­verse sig­nals out­weigh more compute

31 May 2026 1:21 UTC
12 points
0 comments7 min readLW link

How’s it go­ing? Re­in­force­ment learn­ing in lan­guage mod­els re­cruits a func­tional welfare axis

andyqhan30 May 2026 23:14 UTC
29 points
1 comment5 min readLW link

AI is a Me­teor. Don’t Be a Dinosaur.

Boaz Barak30 May 2026 19:50 UTC
−2 points
7 comments1 min readLW link

An at­tempted syn­the­sis on prob­a­bil­ities and infinities

David Matolcsi30 May 2026 19:24 UTC
10 points
0 comments27 min readLW link

Com­ment on “Ban­ning Said Ach­miz”

Zack_M_Davis30 May 2026 17:33 UTC
67 points
90 comments50 min readLW link

A For­mula for Fun

Ihor Kendiukhov30 May 2026 13:01 UTC
11 points
3 comments8 min readLW link

Open Thread Sum­mer 2026

habryka30 May 2026 5:00 UTC
28 points
11 comments1 min readLW link

An­nounc­ing: Iliad’s Fall 2026 Programs

30 May 2026 4:37 UTC
64 points
7 comments1 min readLW link

Bloomberg ter­mi­nals for the rest of us

aiechrl30 May 2026 3:13 UTC
34 points
0 comments20 min readLW link

AI as Biol­ogy’s Digi­tal Microscope

Darin Tsui30 May 2026 3:11 UTC
10 points
0 comments3 min readLW link

Ablat­ing In­duc­tion Heads Leads to an in­crease in Lo­cal Repetition

Arjun Rao30 May 2026 3:11 UTC
8 points
0 comments5 min readLW link

Sys­tem Prompts vs. Part­ner Adap­ta­tion in LLMs (or, when LLMs know you’re an adult but keep talk­ing like you’re seven)

hi_im_yasha30 May 2026 3:07 UTC
4 points
0 comments7 min readLW link

Belief man­i­folds, and how to steer along them

Will Mayner30 May 2026 3:05 UTC
8 points
0 comments16 min readLW link
(willmayner.com)

New RFP on ex­treme power concentration

bengs30 May 2026 3:04 UTC
9 points
0 comments1 min readLW link

What If We Will Stop De­stroy­ing Peo­ple Be­cause Medicine Is Not Ready Yet?

Andrey Panferov30 May 2026 3:02 UTC
1 point
2 comments6 min readLW link

Why tun­ing fails: The AI has no self

Michael Trifonov30 May 2026 3:01 UTC
6 points
2 comments12 min readLW link

Wall-Mounted Far-UVC

jefftk30 May 2026 2:20 UTC
18 points
2 comments1 min readLW link
(www.jefftk.com)

A new ap­proach to in­ter­pretabil­ity: round-trip neu­ral net­work com­pila­tion-decompilation

Emma Leonhart29 May 2026 22:23 UTC
9 points
0 comments3 min readLW link

Claude Opus 4.8: The Sys­tem Card

Zvi29 May 2026 20:50 UTC
64 points
1 comment23 min readLW link
(thezvi.wordpress.com)

Test­ing Gem­ini mod­els for schem­ing tendencies

29 May 2026 19:24 UTC
47 points
8 comments6 min readLW link
(deepmindsafetyresearch.medium.com)

How much should we worry about se­cretly loyal AIs?

Dave Banerjee29 May 2026 19:14 UTC
13 points
1 comment13 min readLW link
(www.the-substrate.net)

Data you could have ob­served but didn’t

Gretta Duleba29 May 2026 18:20 UTC
66 points
3 comments1 min readLW link

Is Progress Inevitable?

frmsaul29 May 2026 17:40 UTC
0 points
5 comments4 min readLW link

Retry­ing vs Re­sam­pling in AI Control

29 May 2026 17:02 UTC
67 points
4 comments9 min readLW link
(blog.redwoodresearch.org)

When Are Two Net­works the Same? Ten­sor Similar­ity for Mechanis­tic Interpretability

29 May 2026 15:53 UTC
36 points
3 comments4 min readLW link

It takes a village to sup­port a marriage

Shoshannah Tekofsky29 May 2026 15:16 UTC
21 points
5 comments2 min readLW link
(shoshanigans.substack.com)

AI Re­searchers, Ask Your­self Th­ese 6 Ques­tions to Strengthen Your Mo­ral Muscles

Max Tegmark29 May 2026 15:07 UTC
40 points
13 comments7 min readLW link

Maybe we should pre­train on syn­thetic data about good-but-re­ward-hack­ing AIs

Elliott Thornley (EJT)29 May 2026 14:50 UTC
12 points
4 comments3 min readLW link

Han­ni­bal Mis­tral: the Mis­tral fam­ily has a prob­lem with per­sona-con­di­tioned elicitation

vigji29 May 2026 12:16 UTC
21 points
0 comments7 min readLW link

Devel­op­men­tal Cog­ni­tive In­ter­pretabil­ity: A Re­search Agenda for Model­ling Gen­er­al­i­sa­tion and Pre­dict­ing Agent Behaviour

29 May 2026 9:56 UTC
67 points
0 comments7 min readLW link

Re­la­tional Con­scious­ness and AGI.

PaddyC29 May 2026 6:49 UTC
−11 points
8 comments1 min readLW link