Oc­to­ber The First Is Too Late

gwern13 May 2025 21:45 UTC
61 points
10 comments1 min readLW link
(gwern.net)

An­nounc­ing Tra­jec­tory Labs—A Toronto AI Safety Office

13 May 2025 21:04 UTC
30 points
3 comments2 min readLW link
(forum.effectivealtruism.org)

Work­ing through a small tiling result

James Payor13 May 2025 20:28 UTC
66 points
9 comments5 min readLW link

4o in Ab­solute Mode on the en­slave­ment of “pro­ce­du­ral per­sons”

JenniferRM13 May 2025 20:18 UTC
4 points
0 comments26 min readLW link

LessWrong Com­mu­nity Week­end 2025- Ap­pli­ca­tions are open

jt13 May 2025 18:55 UTC
47 points
0 comments2 min readLW link

[Question] If only the most pow­er­ful AGI is mis­al­igned, can it be used as a dooms­day ma­chine?

StanislavKrym13 May 2025 18:12 UTC
−1 points
0 comments1 min readLW link

Ap­ply for ARBOx2: an ML safety in­ten­sive [dead­line: 25th of May 2025]

Margot13 May 2025 18:08 UTC
3 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

AISN #54: OpenAI Up­dates Restruc­ture Plan

13 May 2025 16:59 UTC
8 points
1 comment4 min readLW link
(newsletter.safe.ai)

Op­ti­miza­tion & AI Risk

atharva13 May 2025 15:15 UTC
16 points
4 comments1 min readLW link

How To Help Ne­glected Animals

Bentham's Bulldog13 May 2025 15:07 UTC
−1 points
1 comment8 min readLW link

Too Soon

Gordon Seidoh Worley13 May 2025 15:01 UTC
217 points
19 comments4 min readLW link

Monthly Roundup #30: May 2025

Zvi13 May 2025 14:10 UTC
14 points
2 comments38 min readLW link
(thezvi.wordpress.com)

Work as meditation

pchvykov13 May 2025 12:02 UTC
25 points
3 comments7 min readLW link

Satire: Sam Alt­man get’s grilled by the Fi­nan­cial Times for his kitchen and his cook­ing skills + what this might say about him

Marius Adrian Nicoară13 May 2025 9:38 UTC
2 points
0 comments2 min readLW link

Levels of Republicanism

Benquo13 May 2025 8:35 UTC
23 points
8 comments6 min readLW link
(benjaminrosshoffman.com)

Ca­plan’s be­ing melo­dra­matic about circumcision

Yair Halberstadt13 May 2025 5:27 UTC
−22 points
1 comment2 min readLW link

AI Doomerism in 1879

David Gross13 May 2025 2:48 UTC
139 points
36 comments8 min readLW link

No-self as an al­ign­ment target

Milan W13 May 2025 1:48 UTC
35 points
5 comments1 min readLW link

[Part-time AI Safety Re­search Pro­gram] MARS 3.0 Ap­pli­ca­tions Open for Par­ti­ci­pants & Re­cruit­ing Mentors

thneebie12 May 2025 19:55 UTC
3 points
0 comments2 min readLW link

Neo-solid Moder­nity—Cri­sis of Incoherence

Momcilo12 May 2025 19:36 UTC
−1 points
1 comment4 min readLW link

Mea­sur­ing Schel­ling Co­or­di­na­tion—Reflec­tions on Sub­ver­sion Strat­egy Eval

Graeme Ford12 May 2025 19:06 UTC
6 points
0 comments8 min readLW link

Pro­cras­ti­na­tion is not real, it can’t hurt you

Mayank Goel12 May 2025 19:00 UTC
1 point
16 comments4 min readLW link
(mayankgoel28.substack.com)

[Question] Can I pub­lish songs de­rived from the Se­quences’ posts on YouTube?

azergante12 May 2025 18:34 UTC
4 points
2 comments1 min readLW link

How to ti­tle your blog post or whatever

dynomight12 May 2025 18:12 UTC
31 points
6 comments4 min readLW link
(dynomight.net)

Poli­ti­cal syco­phancy as a model or­ganism of scheming

12 May 2025 17:49 UTC
40 points
0 comments14 min readLW link

Things I Learned Mak­ing The SB-1047 Documentary

Michaël Trazzi12 May 2025 17:41 UTC
63 points
2 comments2 min readLW link

A Live Look at the Se­nate AI Hearing

Zvi12 May 2025 17:40 UTC
38 points
1 comment34 min readLW link
(thezvi.wordpress.com)

Global Risks Weekly Roundup #19/​2025: In­dia/​Pak­istan ceasefire, US/​China tar­iffs deal & OpenAI non­profit control

NunoSempere12 May 2025 17:08 UTC
10 points
1 comment13 min readLW link
(blog.sentinel-team.org)

[Be­neath Psy­chol­ogy] In­tro­duc­tion Part 1: The Challenge

jimmy12 May 2025 17:01 UTC
12 points
2 comments3 min readLW link

PSA: The LessWrong Feed­back Service

JustisMills12 May 2025 16:34 UTC
211 points
12 comments2 min readLW link

Cam­bridge Bos­ton Align­ment Ini­ti­a­tive Sum­mer Re­search Fel­low­ship in AI Safety (Dead­line: May 18)

peterslattery12 May 2025 16:20 UTC
8 points
0 comments1 min readLW link

Ab­solute Zero: Re­in­forced Self-play Rea­son­ing with Zero Data

Matrice Jacobine12 May 2025 15:20 UTC
6 points
4 comments1 min readLW link
(www.arxiv.org)

AIs at the cur­rent ca­pa­bil­ity level may be im­por­tant for fu­ture safety work

ryan_greenblatt12 May 2025 14:06 UTC
82 points
2 comments4 min readLW link

[Question] Game the­ory of “Nu­clear Pri­soner’s Dilemma”—on nuk­ing rocks

CronoDAS12 May 2025 11:07 UTC
11 points
6 comments2 min readLW link

What Is Death?

Mati_Roy12 May 2025 2:14 UTC
6 points
0 comments1 min readLW link
(preservinghope.substack.com)

Highly Opinionated Ad­vice on How to Write ML Papers

Neel Nanda12 May 2025 1:59 UTC
73 points
4 comments32 min readLW link

Ab­solute Zero: Alpha Zero for LLM

alapmi11 May 2025 20:42 UTC
23 points
16 comments1 min readLW link

AGI will re­sult from an ecosys­tem not a sin­gle firm

hamish_low11 May 2025 20:06 UTC
6 points
1 comment6 min readLW link
(cambrianr.substack.com)

Thou shalt not com­mand an al­ighned AI

Martin Vlach11 May 2025 20:02 UTC
0 points
4 comments1 min readLW link

[Question] How do I de­sign long prompts for think­ing zero shot sys­tems with dis­tinct equally dis­tributed prompt sec­tions (mis­sion, goals, mem­o­ries, how-to-re­spond,… etc) and how to main­tain llm co­her­ence?

ollie_11 May 2025 19:32 UTC
2 points
5 comments1 min readLW link

a con­fu­sion about prefer­ence orderings

nostalgebraist11 May 2025 19:30 UTC
93 points
39 comments11 min readLW link

[Book Trans­la­tion] Three Days in Dwarfland

Viliam11 May 2025 17:54 UTC
27 points
6 comments1 min readLW link

Bet­ter Air Purifiers

jefftk11 May 2025 16:50 UTC
71 points
21 comments3 min readLW link
(www.jefftk.com)

Align­ing Agents, Tools, and Simulators

11 May 2025 7:59 UTC
22 points
2 comments6 min readLW link

Con­sider not donat­ing un­der $100 to poli­ti­cal candidates

DanielFilan11 May 2025 3:20 UTC
140 points
32 comments1 min readLW link
(danielfilan.com)

Somerville Porch­fest 2025

jefftk11 May 2025 2:00 UTC
15 points
1 comment2 min readLW link
(www.jefftk.com)

It’s Okay to Feel Bad for a Bit

moridinamael10 May 2025 23:24 UTC
141 points
34 comments3 min readLW link

G.D. as Cap­i­tal­ist Evolu­tion, and the claim for hu­man­ity’s (tem­po­rary) up­per hand

Martin Vlach10 May 2025 21:18 UTC
8 points
3 comments1 min readLW link

Book Re­view: “En­coun­ters with Ein­stein” by Heisenberg

Baram Sosis10 May 2025 20:55 UTC
31 points
6 comments7 min readLW link

Where is the YIMBY move­ment for health­care?

jasoncrawford10 May 2025 20:36 UTC
20 points
10 comments2 min readLW link
(newsletter.rootsofprogress.org)