CFAR is run­ning an ex­per­i­men­tal mini-work­shop (June 2-6, Berkeley CA)!

Davis_Kingsley29 May 2025 22:02 UTC
64 points
2 comments2 min readLW link

Or­phaned Poli­cies (Post 5 of 7 on AI Gover­nance)

Mass_Driver29 May 2025 21:42 UTC
70 points
5 comments16 min readLW link

Grad­ual Disem­pow­er­ment: Con­crete Re­search Projects

Raymond Douglas29 May 2025 18:55 UTC
100 points
10 comments10 min readLW link

Do you even have a sys­tem prompt? (PSA /​ repo)

Croissanthology29 May 2025 18:49 UTC
108 points
77 comments2 min readLW link

In­cor­rect Baseline Eval­u­a­tions Call into Ques­tion Re­cent LLM-RL Claims

shash4229 May 2025 18:40 UTC
66 points
7 comments1 min readLW link
(safe-lip-9a8.notion.site)

Dimensionalization

Jordan Rubin29 May 2025 18:18 UTC
7 points
6 comments4 min readLW link
(jordanmrubin.substack.com)

Distil­led Hu­man Judg­ment: Reify­ing AI Alignment

Devansh Mehta29 May 2025 18:06 UTC
2 points
0 comments4 min readLW link

Sum­mer AI Safety In­tro Fel­low­ships in Bos­ton and On­line (Policy & Tech­ni­cal) – Ap­ply by June 6!

jandrade11229 May 2025 18:02 UTC
1 point
0 comments1 min readLW link

Digi­tal sen­tience fund­ing op­por­tu­ni­ties: Sup­port for ap­plied work and research

29 May 2025 15:22 UTC
21 points
0 comments4 min readLW link

When to Be Nice vs Kind

Declan Molony29 May 2025 15:06 UTC
24 points
2 comments1 min readLW link

AI #118: Claude Ascendant

Zvi29 May 2025 14:10 UTC
45 points
8 comments57 min readLW link
(thezvi.wordpress.com)

So­cial Cap­i­tal—Does it Mat­ter?

Momcilo29 May 2025 12:26 UTC
−9 points
1 comment6 min readLW link

Align­ment Cri­sis: Geno­cide Denial

_mp_29 May 2025 12:04 UTC
−11 points
5 comments4 min readLW link

Cross-post­ing to Substack

jefftk29 May 2025 11:10 UTC
12 points
0 comments1 min readLW link
(www.jefftk.com)

Reflec­tions on AI Wis­dom, plus an­nounc­ing Wise AI Wednesdays

Chris_Leong29 May 2025 7:13 UTC
18 points
0 comments3 min readLW link

[Question] What was so great about Move 37?

Caleb Biddulph29 May 2025 7:00 UTC
24 points
4 comments3 min readLW link

Pro­ce­du­ral vs. Causal Understanding

Caleb Biddulph29 May 2025 7:00 UTC
7 points
2 comments2 min readLW link

Se­cu­rity Mind­set: Hack­ing Pin­ball High Scores

gwern29 May 2025 3:39 UTC
27 points
3 comments1 min readLW link
(gwern.net)

Quick Min­i­mal Playhouse

jefftk29 May 2025 2:10 UTC
17 points
1 comment1 min readLW link
(www.jefftk.com)

Cog­ni­tive Ex­haus­tion and Eng­ineered Trust: Les­sons from My Gym

Priyanka Bharadwaj29 May 2025 1:21 UTC
14 points
3 comments3 min readLW link

Truth or Dare

Duncan Sabien (Inactive)29 May 2025 0:07 UTC
263 points
61 comments69 min readLW link

[Question] What should I read to un­der­stand an­ces­tral hu­man so­ciety?

Lorec28 May 2025 23:36 UTC
9 points
4 comments1 min readLW link

The case for coun­ter­mea­sures to memetic spread of mis­al­igned values

Alex Mallen28 May 2025 21:12 UTC
61 points
1 comment7 min readLW link

a case for a less­wrong pri­vate pre­dic­tion market

don't_wanna_be_stupid_any_more28 May 2025 20:26 UTC
3 points
0 comments2 min readLW link

LessWrong Feed [new, now in beta]

Ruby28 May 2025 19:01 UTC
53 points
87 comments8 min readLW link

Fun With Veo 3 and Me­dia Generation

Zvi28 May 2025 18:30 UTC
29 points
0 comments5 min readLW link
(thezvi.wordpress.com)

Re­v­erse Auc­tions for Group De­ci­sion-Making

krishmatta28 May 2025 17:52 UTC
3 points
4 comments3 min readLW link
(krishmatta.net)

What col­lege ma­jor should I choose if I am un­sure?

contrejour28 May 2025 17:50 UTC
−1 points
6 comments1 min readLW link

Eval­u­a­tion As Feed­back Cycle

belos28 May 2025 17:02 UTC
1 point
0 comments18 min readLW link
(bestofagreatlot.substack.com)

How much might AI leg­is­la­tion cost in the U.S.?

will rinehart28 May 2025 16:21 UTC
−5 points
0 comments11 min readLW link

What LLMs lack

p.b.28 May 2025 16:19 UTC
15 points
5 comments3 min readLW link

Playlist In­spired by Man­i­fest 2024

Commander Zander28 May 2025 16:03 UTC
4 points
0 comments1 min readLW link
(open.spotify.com)

AISN #56: Google Re­leases Veo 3

28 May 2025 16:00 UTC
7 points
0 comments4 min readLW link
(newsletter.safe.ai)

How Self-Aware Are LLMs?

Christopher Ackerman28 May 2025 12:57 UTC
30 points
9 comments10 min readLW link

Can We Hack He­donic Tread­mills?

Vincent Li28 May 2025 11:42 UTC
3 points
0 comments3 min readLW link

AI’s goals may not match ours

28 May 2025 9:30 UTC
14 points
1 comment3 min readLW link

AI may pur­sue goals

28 May 2025 9:30 UTC
13 points
0 comments1 min readLW link

The Best Way to Align an LLM: Is In­ner Align­ment Now a Solved Prob­lem?

RogerDearnaley28 May 2025 6:21 UTC
35 points
34 comments9 min readLW link

Spec­tral radii di­men­sion­al­ity re­duc­tion com­puted with­out gra­di­ent calculations

Joseph Van Name28 May 2025 5:06 UTC
5 points
4 comments6 min readLW link

If you’re not sure how to sort a list or grid—se­ri­ate it!

gwern28 May 2025 3:54 UTC
218 points
8 comments3 min readLW link
(www.jstatsoft.org)

Briefly an­a­lyz­ing the 10-year mora­to­rium amendment

RobertM28 May 2025 3:11 UTC
73 points
1 comment3 min readLW link

Does Sort Really Fall Back to Disk?

jefftk28 May 2025 1:20 UTC
13 points
2 comments1 min readLW link
(www.jefftk.com)

Shift Re­sources to Ad­vo­cacy Now (Post 4 of 7 on AI Gover­nance)

Mass_Driver28 May 2025 1:19 UTC
60 points
18 comments32 min readLW link

[Question] Colo­nial­ism in space: Does a col­lec­tion of minds have ex­actly two at­trac­tors?

StanislavKrym27 May 2025 23:35 UTC
7 points
8 comments1 min readLW link

[Question] What are the best ar­gu­ments you’ve seen for the Li­tany of Gendlin?

flowerfeatherfocus27 May 2025 21:19 UTC
7 points
8 comments1 min readLW link

What We Learned from Briefing 70+ Law­mak­ers on the Threat from AI

leticiagarcia27 May 2025 18:23 UTC
495 points
17 comments16 min readLW link
(substack.com)

My script for or­ga­niz­ing OBNYC meetups

Orioth27 May 2025 18:14 UTC
3 points
0 comments4 min readLW link

Un­trusted AIs can ex­ploit feed­back in con­trol protocols

27 May 2025 16:41 UTC
30 points
0 comments16 min readLW link

Re­quiem for the hopes of a pre-AI world

Mitchell_Porter27 May 2025 14:47 UTC
97 points
0 comments3 min readLW link

The Best of All Pos­si­ble Worlds

Jakub Growiec27 May 2025 13:16 UTC
11 points
7 comments49 min readLW link