En­hanc­ing Ge­nomic Foun­da­tion Model Ro­bust­ness through Iter­a­tive Black-Box Ad­ver­sar­ial Training

14 Oct 2025 20:54 UTC
8 points
0 comments7 min readLW link

Pos­tra­tional­ity: An Oral History

Gordon Seidoh Worley14 Oct 2025 19:18 UTC
13 points
0 comments1 min readLW link

Why your boss isn’t wor­ried about AI

beyarkay14 Oct 2025 17:58 UTC
11 points
2 comments6 min readLW link
(boydkane.com)

Hu­man­ity AI Com­mits $500 mil­lion to AI and Democ­racy Pro­tec­tion, AI x Se­cu­rity, and more

peterr14 Oct 2025 17:51 UTC
4 points
0 comments1 min readLW link
(www.macfound.org)

Think­ing Part­ners: Build­ing AI-Pow­ered Knowl­edge Man­age­ment Systems

Quentin FEUILLADE--MONTIXI14 Oct 2025 17:42 UTC
18 points
3 comments10 min readLW link

SS26 Color Stats

sarahconstantin14 Oct 2025 17:20 UTC
21 points
2 comments6 min readLW link
(sarahconstantin.substack.com)

The Bio­chem­i­cal Beauty of Re­ta­tru­tide: How GLP-1s Ac­tu­ally Work

Elizabeth14 Oct 2025 16:00 UTC
82 points
3 comments7 min readLW link
(acesounderglass.com)

My views on Lesswrong

samuelshadrach14 Oct 2025 15:47 UTC
1 point
0 comments4 min readLW link
(samuelshadrach.com)

Trade Es­ca­la­tion, Sup­ply Chain Vuln­er­a­bil­ities and Rare Earth Metals

Zvi14 Oct 2025 15:30 UTC
30 points
0 comments9 min readLW link
(thezvi.wordpress.com)

12 An­gry Agents, or: A Plan for AI Empathy

14 Oct 2025 15:24 UTC
21 points
4 comments12 min readLW link

The “Length” of “Hori­zons”

Adam Scholl14 Oct 2025 14:48 UTC
183 points
27 comments7 min readLW link

My Soft­ware Setup

Morpheus14 Oct 2025 11:56 UTC
16 points
3 comments2 min readLW link
(www.tassiloneubauer.com)

Nar­cis­sism, Echo­ism, and Sovereignism: A 4-D Model of Personality

Dawn Drescher14 Oct 2025 11:18 UTC
7 points
6 comments14 min readLW link
(impartial-priorities.org)

Cur­rent Lan­guage Models Strug­gle to Rea­son in Ciphered Language

14 Oct 2025 9:08 UTC
78 points
7 comments5 min readLW link

A per­sonal take on why you should work at Forethought (maybe)

Lizka14 Oct 2025 8:59 UTC
26 points
2 comments9 min readLW link

Discrete Generation

James Camacho14 Oct 2025 1:38 UTC
4 points
3 comments3 min readLW link
(github.com)

Sur­vey Re­sults: Far UVC and Gly­col Vapors

jefftk14 Oct 2025 1:00 UTC
16 points
0 comments4 min readLW link
(www.jefftk.com)

How AI Ma­nipu­lates—A Case Study

Adele Lopez14 Oct 2025 0:54 UTC
78 points
27 comments13 min readLW link

Re­con­tex­tu­al­iza­tion Miti­gates Speci­fi­ca­tion Gam­ing Without Mod­ify­ing the Specification

14 Oct 2025 0:53 UTC
125 points
15 comments9 min readLW link

AI Psy­chosis, with Tim Hua and Adele Lopez

14 Oct 2025 0:27 UTC
14 points
0 comments1 min readLW link

[Question] What is Less­wrong good for?

Algon13 Oct 2025 23:30 UTC
51 points
6 comments3 min readLW link

Pre­dictabil­ity is Underrated

Algon13 Oct 2025 22:40 UTC
20 points
0 comments2 min readLW link

The Mom Test for AI Ex­tinc­tion Scenarios

Taylor G. Lunt13 Oct 2025 22:21 UTC
70 points
61 comments5 min readLW link

If Any­one Builds It Every­one Dies, a semi-out­sider re­view

dvd13 Oct 2025 22:10 UTC
212 points
67 comments15 min readLW link

Is There a Sound Ar­gu­ment for Gen­er­al­ity in AI?

Juan Cadile13 Oct 2025 21:49 UTC
5 points
1 comment6 min readLW link

Words make us Dumb #1: The “Point”less­ness of Knowledge

Enmai.MCimbu13 Oct 2025 19:53 UTC
−9 points
5 comments6 min readLW link

Rea­sons to sign a state­ment to ban su­per­in­tel­li­gence (+ FAQ for those on the fence)

13 Oct 2025 19:00 UTC
83 points
4 comments13 min readLW link

Water Above the Ocean

Celer13 Oct 2025 16:00 UTC
15 points
2 comments5 min readLW link
(keller.substack.com)

OpenAI #15: More on OpenAI’s Para­noid Law­fare Against Ad­vo­cates of SB 53

Zvi13 Oct 2025 15:00 UTC
104 points
2 comments23 min readLW link
(thezvi.wordpress.com)

Pause House, Blackpool

Greg C13 Oct 2025 11:36 UTC
79 points
2 comments1 min readLW link
(gregcolbourn.substack.com)

Global vs. Lo­cal feedback

Max Dalton13 Oct 2025 10:33 UTC
8 points
0 comments2 min readLW link
(custodienda.substack.com)

Live Gover­nance: AI tools for co­or­di­na­tion with­out centralisation

mbuch13 Oct 2025 8:24 UTC
15 points
0 comments12 min readLW link

[CS 2881r] [Week 6] Re­cur­sive Self-Improvement

Joshua Qin13 Oct 2025 6:56 UTC
4 points
0 comments6 min readLW link

Sublin­ear Utility in Pop­u­la­tion and other Un­com­mon Utilitarianism

Alice Blair13 Oct 2025 6:19 UTC
68 points
15 comments7 min readLW link

RiskiPedia

gavinandresen13 Oct 2025 4:26 UTC
17 points
4 comments1 min readLW link

Don’t Mock Yourself

Algon12 Oct 2025 22:40 UTC
163 points
18 comments2 min readLW link

Ex­per­i­ment: Test your pri­ors on Bernoulli pro­cesses.

joseph_c12 Oct 2025 22:09 UTC
20 points
15 comments1 min readLW link

The Prob­lem of Con­scious­ness and AI as an Eth­i­cal Subject

Nicolas Villarreal12 Oct 2025 18:30 UTC
−5 points
0 comments14 min readLW link

Dr Evil & Realpolitik

James Stephen Brown12 Oct 2025 17:30 UTC
16 points
0 comments5 min readLW link
(nonzerosum.games)

How do we know when some­thing is de­serv­ing of welfare?

Dom Polsinelli12 Oct 2025 16:27 UTC
11 points
7 comments4 min readLW link

The Nar­cis­sis­tic Spectrum

Dawn Drescher12 Oct 2025 15:46 UTC
32 points
0 comments22 min readLW link
(impartial-priorities.org)

Non-copy­a­bil­ity as a se­cu­rity feature

tailcalled12 Oct 2025 9:03 UTC
16 points
4 comments1 min readLW link

In­ter­na­tional Pro­gramme on AI Evaluations

PabloAMC12 Oct 2025 7:12 UTC
3 points
0 comments2 min readLW link

The Align­ment Prob­lem Isn’t Theoretical

Austin Morrissey12 Oct 2025 3:49 UTC
0 points
1 comment14 min readLW link

If a Lioness Could Speak

Taylor G. Lunt12 Oct 2025 3:43 UTC
−1 points
0 comments2 min readLW link

De­sign­ing for per­pet­ual control

Remmelt12 Oct 2025 2:06 UTC
1 point
11 comments2 min readLW link

“Naive Con­se­quen­tial­ism” as a Thought-Ter­mi­nat­ing cliche

Jacob Goldsmith12 Oct 2025 0:54 UTC
−3 points
0 comments3 min readLW link

[Question] How long do AI com­pa­nies have to achieve sig­nifi­cant ca­pa­bil­ity gains be­fore fund­ing col­lapses?

Hide11 Oct 2025 23:20 UTC
41 points
8 comments1 min readLW link

I wasn’t con­fused by Thermodynamics

Algon11 Oct 2025 22:20 UTC
26 points
4 comments2 min readLW link

Sub­scribe to my Inkhaven feed!

Alex_Altair11 Oct 2025 20:41 UTC
21 points
3 comments2 min readLW link