Book Re­view: To Ex­plain the World

Algon16 Oct 2025 23:00 UTC
23 points
5 comments6 min readLW link

AISN#64: New AGI Defi­ni­tion and Se­nate Bill Would Estab­lish Li­a­bil­ity for AI Harms

16 Oct 2025 18:06 UTC
5 points
1 comment5 min readLW link
(aisafety.substack.com)

Find­ing Fea­tures in Neu­ral Net­works with the Em­piri­cal NTK

jylin0416 Oct 2025 18:04 UTC
35 points
1 comment5 min readLW link

Learn­ing from the Lud­dites: Im­pli­ca­tions for a mod­ern AI labour movement

JanWehner16 Oct 2025 17:11 UTC
12 points
0 comments8 min readLW link

Re­duc­ing risk from schem­ing by study­ing trained-in schem­ing behavior

ryan_greenblatt16 Oct 2025 16:16 UTC
32 points
0 comments11 min readLW link

Job Open­ings: SWE, PM, and Grants Co­or­di­na­tor to help im­prove grant-making

Ethan Ashkie16 Oct 2025 16:14 UTC
13 points
0 comments1 min readLW link
(survivalandflourishing.com)

AI #138 Part 1: The Peo­ple De­mand Erotic Sycophants

Zvi16 Oct 2025 15:41 UTC
25 points
7 comments46 min readLW link
(thezvi.wordpress.com)

Cheap Labour Every­where

Morpheus16 Oct 2025 13:15 UTC
136 points
34 comments2 min readLW link

Quan­tum im­mor­tal­ity and AI risk – the fate of a lonely survivor

avturchin16 Oct 2025 11:40 UTC
8 points
0 comments1 min readLW link

The Com­plex Uni­verse The­ory of AI Psychology

Andrew Tomazos16 Oct 2025 4:31 UTC
0 points
0 comments1 min readLW link
(www.tomazos.com)

[CS 2881r AI Safety] [Week 5] Con­tent Policies

16 Oct 2025 4:27 UTC
1 point
0 comments12 min readLW link

Halfhaven Digest #2

Taylor G. Lunt16 Oct 2025 3:18 UTC
6 points
0 comments3 min readLW link

Fra­grance Free Confusion

jefftk16 Oct 2025 2:50 UTC
17 points
13 comments3 min readLW link
(www.jefftk.com)

The Three Levels of Agency

Taylor G. Lunt16 Oct 2025 2:14 UTC
15 points
1 comment5 min readLW link

Me­mory De­cod­ing Jour­nal Club: Func­tional con­nec­tomics re­veals gen­eral wiring rule in mouse vi­sual cortex

Devin Ward16 Oct 2025 1:56 UTC
1 point
0 comments1 min readLW link

Elec­tron­ics Me­chanic → AI Safety Re­searcher: A 30-Month Jour­ney to Model Welfare

probablyjonah16 Oct 2025 0:43 UTC
2 points
0 comments3 min readLW link

Some as­tral en­ergy ex­trac­tion methods

Algon15 Oct 2025 23:22 UTC
24 points
3 comments2 min readLW link

AI-202X-slow­down: can CoT-based AIs be­come ca­pa­ble of al­ign­ing the ASI?

StanislavKrym15 Oct 2025 22:46 UTC
18 points
0 comments6 min readLW link

Monthly Roundup #35: Oc­to­ber 2025

Zvi15 Oct 2025 19:50 UTC
24 points
1 comment49 min readLW link
(thezvi.wordpress.com)

Rogue in­ter­nal de­ploy­ments via ex­ter­nal APIs

15 Oct 2025 19:34 UTC
34 points
4 comments6 min readLW link

Chem­i­cal Te­lescopes And The Pro­cess Of Science

sonicrocketman15 Oct 2025 18:05 UTC
5 points
0 comments4 min readLW link
(brianschrader.com)

Up­dat­ing the name of Open Philan­thropy’s AI program

lukeprog15 Oct 2025 17:45 UTC
7 points
0 comments2 min readLW link

Open Global In­vest­ment: Com­par­i­sons and Criticisms

Algon15 Oct 2025 17:20 UTC
15 points
0 comments4 min readLW link
(aisafety.info)

We are too com­fortable with AI “magic”

Baybar15 Oct 2025 17:00 UTC
−2 points
0 comments6 min readLW link

Un­til the stars burn out? Assess­ing the stakes of AGI lock-in

MattAlexander15 Oct 2025 16:38 UTC
6 points
0 comments6 min readLW link

Are calm in­tro­verts (like East Asi­ans) uniquely suited for space travel & Mars mis­sions?

David Sun15 Oct 2025 16:19 UTC
−4 points
2 comments1 min readLW link
(davidsun.substack.com)

It will cost you noth­ing to “bribe” a Utilitarian

Gabriel Alfour15 Oct 2025 15:51 UTC
41 points
4 comments4 min readLW link

How I Be­came a 5x Eng­ineer with Claude Code

Gordon Seidoh Worley15 Oct 2025 14:10 UTC
73 points
24 comments7 min readLW link
(www.uncertainupdates.com)

That Mad Olympiad

Tomás B.15 Oct 2025 13:45 UTC
178 points
14 comments14 min readLW link

A New Global Risk: Large Comet’s Im­pact on Sun Could Cause Fires on Earth

avturchin15 Oct 2025 13:20 UTC
58 points
6 comments2 min readLW link

Can LLMs Co­or­di­nate? A Sim­ple Schel­ling Point Experiment

Håvard Tveit Ihle15 Oct 2025 12:25 UTC
35 points
11 comments3 min readLW link

Hu­mans Are Spiky (In an LLM World)

faul_sname15 Oct 2025 8:40 UTC
28 points
5 comments1 min readLW link

Gnash­ing of Teeth

Martin Sustrik15 Oct 2025 6:11 UTC
29 points
0 comments4 min readLW link
(www.250bpm.com)

Com­mu­nism By Another Name

Charlie Sanders15 Oct 2025 2:21 UTC
−5 points
1 comment3 min readLW link
(www.dailymicrofiction.com)

Si­tu­a­tional Aware­ness as a Prompt for LLM Parasitism

Baybar15 Oct 2025 1:45 UTC
8 points
6 comments19 min readLW link

Min­i­mal Prompt In­duc­tion of Self-Talk in Base LLMs

dwmd15 Oct 2025 1:15 UTC
2 points
0 comments5 min readLW link

The sum of its parts: com­pos­ing AI con­trol protocols

15 Oct 2025 1:11 UTC
12 points
1 comment11 min readLW link

En­hanc­ing Ge­nomic Foun­da­tion Model Ro­bust­ness through Iter­a­tive Black-Box Ad­ver­sar­ial Training

14 Oct 2025 20:54 UTC
8 points
0 comments7 min readLW link

Pos­tra­tional­ity: An Oral History

Gordon Seidoh Worley14 Oct 2025 19:18 UTC
13 points
0 comments1 min readLW link

Why your boss isn’t wor­ried about AI

beyarkay14 Oct 2025 17:58 UTC
11 points
2 comments6 min readLW link
(boydkane.com)

Hu­man­ity AI Com­mits $500 mil­lion to AI and Democ­racy Pro­tec­tion, AI x Se­cu­rity, and more

peterr14 Oct 2025 17:51 UTC
4 points
0 comments1 min readLW link
(www.macfound.org)

Think­ing Part­ners: Build­ing AI-Pow­ered Knowl­edge Man­age­ment Systems

Quentin FEUILLADE--MONTIXI14 Oct 2025 17:42 UTC
18 points
3 comments10 min readLW link

SS26 Color Stats

sarahconstantin14 Oct 2025 17:20 UTC
21 points
2 comments6 min readLW link
(sarahconstantin.substack.com)

The Bio­chem­i­cal Beauty of Re­ta­tru­tide: How GLP-1s Ac­tu­ally Work

Elizabeth14 Oct 2025 16:00 UTC
82 points
3 comments7 min readLW link
(acesounderglass.com)

My views on Lesswrong

samuelshadrach14 Oct 2025 15:47 UTC
1 point
0 comments4 min readLW link
(samuelshadrach.com)

Trade Es­ca­la­tion, Sup­ply Chain Vuln­er­a­bil­ities and Rare Earth Metals

Zvi14 Oct 2025 15:30 UTC
30 points
0 comments9 min readLW link
(thezvi.wordpress.com)

12 An­gry Agents, or: A Plan for AI Empathy

14 Oct 2025 15:24 UTC
21 points
4 comments12 min readLW link

The “Length” of “Hori­zons”

Adam Scholl14 Oct 2025 14:48 UTC
183 points
27 comments7 min readLW link

My Soft­ware Setup

Morpheus14 Oct 2025 11:56 UTC
16 points
3 comments2 min readLW link
(www.tassiloneubauer.com)

Nar­cis­sism, Echo­ism, and Sovereignism: A 4-D Model of Personality

Dawn Drescher14 Oct 2025 11:18 UTC
7 points
6 comments14 min readLW link
(impartial-priorities.org)