LessOn­line 2026

nomagicpill9 Jun 2026 23:24 UTC
3 points
0 comments5 min readLW link
(nomagicpill.substack.com)

“Pro­gram­mer Science Fic­tion: My case for a new sub-genre”, Sam T. Oates 2026

gwern9 Jun 2026 23:23 UTC
47 points
10 comments1 min readLW link
(stoates.substack.com)

The Di­su­til­ity of FDT: on Utility Func­tions and Vot­ing, In­sights from Be­hav­ioral Eco­nomics and De­ci­sion Theory

DanielW9 Jun 2026 23:13 UTC
5 points
3 comments8 min readLW link

Three Labs With a Plan and A Memorandum

Zvi9 Jun 2026 22:40 UTC
45 points
0 comments12 min readLW link
(thezvi.wordpress.com)

Harm­ful­ness Direc­tions in OLMo

9 Jun 2026 22:31 UTC
20 points
0 comments11 min readLW link

“Self-Con­trol” Is A (Neu­rolog­i­cal) Type Error

Elliot Callender9 Jun 2026 21:34 UTC
−6 points
0 comments1 min readLW link

Towards a For­mal Scien­tific Epistemology

Richard_Ngo9 Jun 2026 20:31 UTC
75 points
9 comments7 min readLW link
(www.mindthefuture.info)

Some In­ter­est­ing Papers on RLVR

CarolusRenniusVitellius9 Jun 2026 19:00 UTC
20 points
5 comments4 min readLW link

A Mike’s-Eye View of ARC’s Research

Mikewins9 Jun 2026 18:30 UTC
64 points
1 comment11 min readLW link
(www.alignment.org)

An LLM Flagged My Paper About LLMs Flag­ging Things.

Failfinder709 Jun 2026 18:00 UTC
5 points
0 comments2 min readLW link

The Skep­tic, the Bayesian, Em­piri­cism and Claims to Know:

DanielW9 Jun 2026 17:52 UTC
4 points
4 comments4 min readLW link

Claude Fable 5 and Mythos 5 [Linkpost]

fluxxrider9 Jun 2026 17:19 UTC
42 points
10 comments1 min readLW link

5 Things I Learned About Peo­ple From Do­ing Stand-Up Comedy

Luise Woehlke9 Jun 2026 15:52 UTC
−4 points
5 comments2 min readLW link
(open.substack.com)

The Machines Lack Honour

Raymond Douglas9 Jun 2026 15:30 UTC
169 points
21 comments12 min readLW link

High Dy­namic Range DIY Air Testing

jefftk9 Jun 2026 15:00 UTC
13 points
0 comments4 min readLW link
(www.jefftk.com)

AI Su­per PAC tracker

Mikhail Samin9 Jun 2026 14:57 UTC
26 points
0 comments1 min readLW link
(electhumans.com)

[Linkpost] Evals for “SPI-in­com­pat­i­ble” be­hav­ior & rea­son­ing: Guide to ini­tial research

Anthony DiGiovanni9 Jun 2026 13:44 UTC
23 points
0 comments1 min readLW link
(docs.google.com)

Sub­ver­sion-Re­sis­tance for Free from For­mal Verification

Adam Chlipala9 Jun 2026 12:01 UTC
7 points
0 comments7 min readLW link

LLMs and al­most good code

kqr9 Jun 2026 7:21 UTC
33 points
9 comments3 min readLW link
(entropicthoughts.com)

On Slop

Jan9 Jun 2026 1:08 UTC
32 points
4 comments7 min readLW link
(universalprior.substack.com)

How to build a can­cer vac­cine, and whether they will work this time

Abhishaike Mahajan8 Jun 2026 20:45 UTC
58 points
9 comments25 min readLW link
(www.owlposting.com)

Effi­cient trade­offs and the safety-use­ful­ness trade­off model

Buck8 Jun 2026 20:28 UTC
42 points
1 comment8 min readLW link

Ac­cel­er­ated Skill Learn­ing via Dream Eng­ineer­ing and Biofeedback

Elliot Callender8 Jun 2026 20:08 UTC
5 points
2 comments3 min readLW link

How valuable are weak AI safety reg­u­la­tions?

MichaelDickens8 Jun 2026 18:24 UTC
28 points
0 comments6 min readLW link

How to re­duce ca­pa­bil­ity degra­da­tion from off-model SFT

8 Jun 2026 16:24 UTC
21 points
0 comments3 min readLW link

The Next Swan: Frank Ram­sey, Vari­able Hy­po­thet­i­cals, and the Bet on Induction

Ramseyian8 Jun 2026 12:01 UTC
4 points
0 comments18 min readLW link

Cover­age-driven al­ign­ment—What ‘Teach­ing Claude Why’ can bor­row from AV verification

Yoav Hollander8 Jun 2026 11:42 UTC
16 points
4 comments14 min readLW link
(blog.foretellix.com)

Bun’s Mi­gra­tion from Zig to Rust as a Po­ten­tial Case Study for Grad­ual Disempowerment

Sayhan Yalvaçer8 Jun 2026 7:06 UTC
96 points
8 comments3 min readLW link

Con­tra Dance at LessOnline

jefftk8 Jun 2026 5:50 UTC
23 points
0 comments1 min readLW link
(www.jefftk.com)

Honk­ing is good

PossiblyElaine8 Jun 2026 4:36 UTC
9 points
7 comments4 min readLW link
(open.substack.com)

The CIA be­lieves everything

volpe8 Jun 2026 0:43 UTC
22 points
10 comments2 min readLW link
(volpe.envs.net)

How do peo­ple stop spiral­ing about Roko’s Basilisk & acausal ex­tor­tion?

anon2028 Jun 2026 0:39 UTC
9 points
6 comments1 min readLW link

Con­tex­tual Iden­tity Laun­der­ing: How Claude’s Image Re­fusal Can Be Routed Through Web Search

Failfinder708 Jun 2026 0:39 UTC
7 points
2 comments9 min readLW link

Men­tal cau­sa­tion is not load-bearing

jessicata7 Jun 2026 20:43 UTC
38 points
4 comments10 min readLW link

How Far Apart Does a Model Think Its To­kens Are?

Brendan Long7 Jun 2026 20:20 UTC
47 points
9 comments10 min readLW link
(www.brendanlong.com)

Au­topi­lot Thinking

XelaP7 Jun 2026 20:20 UTC
10 points
4 comments6 min readLW link

Se­cret Loy­alties Likely Raise Re­mote-Influenceability

Kaustubh Kislay7 Jun 2026 17:51 UTC
13 points
0 comments6 min readLW link

From One Piece to One Pace - Vi­sion and mis­sion in co­or­di­na­tion of agents

a unemployed pastor- de S Brito7 Jun 2026 17:07 UTC
2 points
0 comments4 min readLW link

Ne­glected Ba­sics of AI Alignment

Quirinus_Quirrell7 Jun 2026 9:02 UTC
28 points
2 comments6 min readLW link

The Hats of LessOnline

AprilSR7 Jun 2026 8:57 UTC
15 points
2 comments3 min readLW link
(aprilsr.substack.com)

Can ac­ti­va­tion ver­bal­iz­ers sur­face an in­ter­nal chain of thought?

7 Jun 2026 4:24 UTC
122 points
0 comments16 min readLW link

Fron­tier Models Still Lag Be­hind Hu­mans at Ro­bust Belief-State Tracking

Lukas Frei6 Jun 2026 23:54 UTC
13 points
6 comments5 min readLW link

Com­ing Around To Poli­ti­cal Donations

jefftk6 Jun 2026 21:30 UTC
59 points
8 comments2 min readLW link
(www.jefftk.com)

Anal­y­sis of Me­tastable States in the Trans­former Ac­ti­va­tion Space

Zach Baker6 Jun 2026 21:30 UTC
10 points
0 comments20 min readLW link

The Di­a­mond Lemma

Isaac Newton6 Jun 2026 21:15 UTC
21 points
0 comments7 min readLW link
(archimedeanmonoid.substack.com)

Iliad is Hiring

Peter Jean6 Jun 2026 21:08 UTC
13 points
0 comments1 min readLW link

Against Corrigibility

peralice6 Jun 2026 20:28 UTC
66 points
17 comments12 min readLW link

The Resi­d­ual Stream Has a Geom­e­try of Time

Fodenthal6 Jun 2026 19:57 UTC
23 points
0 comments8 min readLW link

Ex­po­nen­tial Solitude

PeterMaui6 Jun 2026 19:49 UTC
5 points
1 comment9 min readLW link

Freud heard a ru­mor that Science ex­isted, and had a won­der­ful dream

Bruce Middleton6 Jun 2026 14:47 UTC
8 points
8 comments6 min readLW link