Arusha Per­pet­ual Chicken—an un­likely iter­ated game

James Stephen Brown6 Apr 2025 22:56 UTC
15 points
1 comment5 min readLW link
(nonzerosum.games)

How Gay is the Vat­i­can?

rba6 Apr 2025 21:27 UTC
58 points
32 comments7 min readLW link

Aus­tralia’s AI Cross­roads: Elec­tion 2025 Town Hall

Peter Horniak6 Apr 2025 21:17 UTC
1 point
0 comments1 min readLW link

The Lizard­man and the Black Hat Bobcat

Screwtape6 Apr 2025 19:02 UTC
109 points
15 comments9 min readLW link

Would this solve the (outer) al­ign­ment prob­lem, or at least help?

Wes R6 Apr 2025 18:49 UTC
−2 points
1 comment13 min readLW link

[Question] What are the fun­da­men­tal differ­ences be­tween teach­ing the AIs and hu­mans?

StanislavKrym6 Apr 2025 18:17 UTC
3 points
0 comments1 min readLW link

An “Op­ti­mistic” 2027 Timeline

Yitz6 Apr 2025 16:39 UTC
13 points
16 comments9 min readLW link

Thoughts on Creat­ing a Good Language

Towards_Keeperhood6 Apr 2025 15:57 UTC
1 point
2 comments7 min readLW link

The REPHRASE Cir­cuit: How Fine-Tun­ing En­hances LLMs to REPHRASE Text

Karthik Viswanathan6 Apr 2025 15:02 UTC
4 points
0 comments5 min readLW link

[Re­search sprint] Sin­gle-model cross­coder fea­ture ab­la­tion and steering

Thomas Read6 Apr 2025 14:42 UTC
8 points
0 comments12 min readLW link

Fer­rer, Pilar, and Me

Askwho6 Apr 2025 11:22 UTC
21 points
1 comment4 min readLW link
(open.substack.com)

FlexChunk: En­abling 100M×100M Out-of-Core SpMV (~1.8 min, ~1.7 GB RAM) with Near-Lin­ear Scaling

Daniil Strizhov6 Apr 2025 5:27 UTC
1 point
0 comments7 min readLW link

A col­lec­tion of ap­proaches to con­fronting doom, and my thoughts on them

Ruby6 Apr 2025 2:11 UTC
48 points
18 comments12 min readLW link

A Slow Guide to Con­fronting Doom

Ruby6 Apr 2025 2:10 UTC
84 points
20 comments14 min readLW link

[Linkpost] Vi­sual roadmap to strong hu­man germline engineering

TsviBT5 Apr 2025 22:22 UTC
30 points
0 comments1 min readLW link

Google Deep­Mind: An Ap­proach to Tech­ni­cal AGI Safety and Security

Rohin Shah5 Apr 2025 22:00 UTC
73 points
12 comments18 min readLW link
(arxiv.org)

In­tro­duc­tion to Rep­re­sent­ing Sen­tences as Log­i­cal Statements

Towards_Keeperhood5 Apr 2025 20:35 UTC
22 points
9 comments16 min readLW link

Me­mory De­cod­ing Jour­nal Club: A col­lab­o­ra­tion of the Car­bon­copies Foun­da­tion and BPF Aspira­tional Neuroscience

Devin Ward5 Apr 2025 20:27 UTC
1 point
0 comments1 min readLW link

Meta re­leases Llama-4 herd of models

winstonBosan5 Apr 2025 19:51 UTC
14 points
5 comments1 min readLW link

Against podcasts

Adam Zerner5 Apr 2025 19:20 UTC
35 points
19 comments4 min readLW link

What are Re­spon­si­ble Scal­ing Poli­cies (RSPs)?

5 Apr 2025 16:01 UTC
3 points
0 comments1 min readLW link
(aisafety.info)

What does Yann LeCun think about AGI? A sum­mary of his talk, “Math­e­mat­i­cal Ob­sta­cles on the Way to Hu­man-Level AI”

Adam Jones5 Apr 2025 12:21 UTC
13 points
0 comments2 min readLW link

I Have No Mouth but I Must Speak

Jack5 Apr 2025 7:42 UTC
7 points
8 comments8 min readLW link

Pre­dic­tion Mar­kets Are Mediocre

Ape in the coat5 Apr 2025 6:54 UTC
3 points
13 comments3 min readLW link

Among Us: A Sand­box for Agen­tic Deception

5 Apr 2025 6:24 UTC
110 points
7 comments7 min readLW link

Ai Cone of Prob­a­bilties—what aren’t we talk­ing about?

Marzipan5 Apr 2025 5:51 UTC
−10 points
5 comments2 min readLW link

Quar­ter Inch Cables are Devious

jefftk5 Apr 2025 2:40 UTC
13 points
4 comments1 min readLW link
(www.jefftk.com)

Most Ques­tion­able De­tails in ‘AI 2027’

Commander Zander5 Apr 2025 0:32 UTC
34 points
12 comments6 min readLW link

Karel Čapek’s ‘War with the Newts’ 1936 review

Petr 'Margot' Andreev4 Apr 2025 23:12 UTC
−10 points
1 comment1 min readLW link

How much progress ac­tu­ally hap­pens in the­o­ret­i­cal physics?

ChristianKl4 Apr 2025 23:08 UTC
32 points
32 comments1 min readLW link

Self-Repli­ca­tion: AI already can do it

Andrey Seryakov4 Apr 2025 22:37 UTC
5 points
0 comments5 min readLW link

Join Vi­tal­ist Bay: An 8-Week Longevity and Rad­i­cal Life Ex­ten­sion Event Series in Berkeley (April-May 2025)

Vitaran4 Apr 2025 21:03 UTC
6 points
0 comments1 min readLW link

Sleep peace­fully: no hid­den rea­son­ing de­tected in LLMs. Well, at least in small ones.

4 Apr 2025 20:49 UTC
17 points
2 comments7 min readLW link

AI com­pa­nies’ un­mon­i­tored in­ter­nal AI use poses se­ri­ous risks

sjadler4 Apr 2025 18:17 UTC
13 points
2 comments1 min readLW link
(stevenadler.substack.com)

Will com­pute bot­tle­necks pre­vent a soft­ware in­tel­li­gence ex­plo­sion?

Tom Davidson4 Apr 2025 17:41 UTC
77 points
25 comments12 min readLW link

Join Us for the Me­mory De­cod­ing Jour­nal Club!

Devin Ward4 Apr 2025 17:13 UTC
1 point
0 comments1 min readLW link

Align­ment fak­ing CTFs: Ap­ply to my MATS stream

joshc4 Apr 2025 16:29 UTC
61 points
0 comments4 min readLW link

LLM AGI will have mem­ory, and mem­ory changes alignment

Seth Herd4 Apr 2025 14:59 UTC
73 points
15 comments9 min readLW link

A Bunch of Ma­tryoshka SAEs

4 Apr 2025 14:53 UTC
29 points
0 comments8 min readLW link

AI CoT Rea­son­ing Is Often Unfaithful

Zvi4 Apr 2025 14:50 UTC
66 points
4 comments7 min readLW link
(thezvi.wordpress.com)

Med­i­ta­tion and Re­duced Sleep Need

niplav4 Apr 2025 14:42 UTC
36 points
8 comments3 min readLW link

Su­ing OpenAI Won’t Save the Arts

E.G. Blee-Goldman4 Apr 2025 13:42 UTC
2 points
0 comments5 min readLW link

For Policy’s Sake: Why We Must Dist­in­guish AI Safety from AI Se­cu­rity in Reg­u­la­tory Governance

Katalina Hernandez4 Apr 2025 9:16 UTC
6 points
11 comments6 min readLW link

Ex­plain­ing the Joke: Paus­ing is The Way

WillPetillo4 Apr 2025 9:04 UTC
24 points
2 comments10 min readLW link

ACX/​EA Hy­der­abad Meetup

4 Apr 2025 8:12 UTC
3 points
2 comments1 min readLW link

Tools for de­ci­sion-sup­port, de­liber­a­tion, sense-mak­ing, reasoning

David James4 Apr 2025 2:27 UTC
3 points
0 comments1 min readLW link

Cheese­cake Frosting

jefftk4 Apr 2025 2:10 UTC
10 points
9 comments1 min readLW link
(www.jefftk.com)

Chang­ing my mind about Chris­ti­ano’s ma­lign prior argument

Cole Wyeth4 Apr 2025 0:54 UTC
27 points
34 comments7 min readLW link

POTUS Pre­dic­tions Tournament

ChristianWilliams3 Apr 2025 22:48 UTC
15 points
0 comments1 min readLW link
(www.metaculus.com)

“Long” timelines to ad­vanced AI have got­ten crazy short

Matrice Jacobine3 Apr 2025 22:46 UTC
21 points
0 comments1 min readLW link
(helentoner.substack.com)