The Bel­l­man equa­tion does not ap­ply to bounded rationality

Christopher King26 Jun 2025 23:01 UTC
17 points
2 comments1 min readLW link

Re­cent and fore­casted rates of soft­ware and hard­ware progress

elifland26 Jun 2025 22:37 UTC
46 points
0 comments8 min readLW link

Too Many Defi­ni­tions of Consciousness

Commander Zander26 Jun 2025 22:22 UTC
7 points
2 comments1 min readLW link

May-June 2025 Progress in Guaran­teed Safe AI

Quinn26 Jun 2025 21:30 UTC
8 points
0 comments4 min readLW link
(gsai.substack.com)

How many GPUs are mar­kets ex­pect­ing?

CaseyMilkweed26 Jun 2025 21:17 UTC
6 points
0 comments3 min readLW link
(caseymilkweed.substack.com)

A Guide For LLM-As­sisted Web Research

26 Jun 2025 18:39 UTC
46 points
3 comments7 min readLW link

RLAIF/​RLHF for Public Value Align­ment En­hanc­ing Trans­parency in LLMs

Jada4226 Jun 2025 18:32 UTC
1 point
0 comments2 min readLW link

If Mo­ral Real­ism is true, then the Orthog­o­nal­ity Th­e­sis is false.

Eye You26 Jun 2025 18:31 UTC
6 points
13 comments1 min readLW link

The Cadca Tran­si­tion Map—Nav­i­gat­ing the Path to the ASI Singleton

cadca26 Jun 2025 18:30 UTC
1 point
0 comments10 min readLW link

Get­ting To and From Monism

unication26 Jun 2025 18:28 UTC
−5 points
21 comments3 min readLW link

AI #122: Pay­ing The Mar­ket Price

Zvi26 Jun 2025 18:10 UTC
36 points
2 comments40 min readLW link
(thezvi.wordpress.com)

Love Is­land USA Sea­son 7 Epi­sode 20: What Could The Pro­duc­ers Be Thinking

Zvi26 Jun 2025 17:31 UTC
17 points
14 comments14 min readLW link

The need to rel­a­tivise in de­bate

26 Jun 2025 16:23 UTC
31 points
2 comments5 min readLW link

Prover­bial Corollaries

Jordan Rubin26 Jun 2025 15:25 UTC
11 points
0 comments2 min readLW link
(jordanmrubin.substack.com)

How meta­phys­i­cal be­liefs shape crit­i­cal as­pects of AI development

Jáchym Fibír26 Jun 2025 15:13 UTC
−9 points
8 comments8 min readLW link
(www.phiand.ai)

The In­dus­trial Explosion

26 Jun 2025 14:41 UTC
128 points
70 comments15 min readLW link
(www.forethought.org)

The Ice­berg The­ory of Meaning

Richard Juggins26 Jun 2025 12:13 UTC
10 points
9 comments5 min readLW link

If we get things right, AI could have huge benefits

26 Jun 2025 8:19 UTC
5 points
0 comments1 min readLW link

Ad­vanced AI is a big deal even if we don’t lose control

26 Jun 2025 8:19 UTC
8 points
0 comments2 min readLW link

Defeat may be ir­re­versibly catastrophic

26 Jun 2025 8:19 UTC
5 points
0 comments2 min readLW link

If Not Now, When?

Yair Halberstadt26 Jun 2025 6:10 UTC
31 points
3 comments1 min readLW link

How Much Data From a Se­quenc­ing Run?

jefftk26 Jun 2025 2:30 UTC
13 points
0 comments2 min readLW link
(www.jefftk.com)

The Prac­ti­cal Value of Flawed Models: A Re­sponse to tito­tal’s AI 2027 Critique

Michelle_Ma25 Jun 2025 22:15 UTC
7 points
1 comment6 min readLW link

I Tested LLM Agents on Sim­ple Safety Rules. They Failed in Sur­pris­ing and In­for­ma­tive Ways.

Ram Potham25 Jun 2025 21:39 UTC
9 points
12 comments6 min readLW link

Tech for Thinking

sarahconstantin25 Jun 2025 21:30 UTC
60 points
9 comments7 min readLW link
(sarahconstantin.substack.com)

Me­mory De­cod­ing Jour­nal Club: Sys­tems con­soli­da­tion re­or­ga­nizes hip­pocam­pal en­gram cir­cuitry

Devin Ward25 Jun 2025 21:21 UTC
3 points
0 comments1 min readLW link

Mak­ing Sense of Con­scious­ness Part 1: Per­cep­tual Awareness

sarahconstantin25 Jun 2025 21:10 UTC
19 points
0 comments9 min readLW link
(sarahconstantin.substack.com)

Dou­ble Pod­cast Drop on AI Safety

jacobhaimes25 Jun 2025 20:11 UTC
5 points
0 comments1 min readLW link

Is there a loom­ing Cul­tural Om­ni­cide?

Jared M.25 Jun 2025 18:18 UTC
24 points
7 comments5 min readLW link

A Method­ol­o­gist’s Apology

adamShimi25 Jun 2025 16:52 UTC
13 points
0 comments9 min readLW link
(formethods.substack.com)

Me­la­tonin Self-Ex­per­i­ment Results

silentbob25 Jun 2025 15:58 UTC
60 points
6 comments8 min readLW link

In­ter­stel­lar travel will prob­a­bly doom the long-term future

Jordan Stone25 Jun 2025 15:32 UTC
29 points
6 comments16 min readLW link

Sum­mary of John Halstead’s Book-Length Re­port on Ex­is­ten­tial Risks From Cli­mate Change

Bentham's Bulldog25 Jun 2025 15:14 UTC
44 points
14 comments21 min readLW link

Lurk­ing in the Noise

J Bostock25 Jun 2025 13:36 UTC
37 points
2 comments4 min readLW link

New Paper: Am­bigu­ous On­line Learning

Vanessa Kosoy25 Jun 2025 9:14 UTC
30 points
2 comments1 min readLW link
(arxiv.org)

Emer­gence of Si­mu­la­tors and Agents

25 Jun 2025 6:59 UTC
21 points
0 comments5 min readLW link

Defin­ing Cor­rigible and Use­ful Goals

Rubi J. Hudson25 Jun 2025 3:51 UTC
38 points
2 comments24 min readLW link

Mul­tispecies Me­tage­nomic Calibration

jefftk25 Jun 2025 2:50 UTC
12 points
0 comments1 min readLW link
(www.jefftk.com)

A regime-change power-vac­uum con­jec­ture about group belief

TsviBT24 Jun 2025 23:16 UTC
41 points
16 comments3 min readLW link

Ap­ply to be a men­tor in SPAR!

agucova24 Jun 2025 23:00 UTC
10 points
0 comments1 min readLW link

Machines of Faith­ful Obedience

Boaz Barak24 Jun 2025 22:06 UTC
41 points
19 comments10 min readLW link

Gra­di­ent Des­cent on To­ken In­put Embeddings

KAP24 Jun 2025 20:24 UTC
8 points
0 comments6 min readLW link

A crisis simu­la­tion changed how I think about AI risk

sjadler24 Jun 2025 20:04 UTC
5 points
0 comments2 min readLW link
(open.substack.com)

Towards a the­ory of lo­cal altruism

DMMF24 Jun 2025 19:39 UTC
11 points
1 comment5 min readLW link
(notnottalmud.substack.com)

Why “train­ing against schem­ing” is hard

Marius Hobbhahn24 Jun 2025 19:08 UTC
66 points
2 comments12 min readLW link

An­a­lyz­ing A Cri­tique Of The AI 2027 Timeline Forecasts

Zvi24 Jun 2025 18:50 UTC
76 points
38 comments30 min readLW link
(thezvi.wordpress.com)

What does 10x-ing effec­tive com­pute get you?

ryan_greenblatt24 Jun 2025 18:33 UTC
55 points
10 comments12 min readLW link

My pitch for the AI Village

Daniel Kokotajlo24 Jun 2025 15:00 UTC
178 points
35 comments5 min readLW link

An Anal­ogy for Interpretability

Roman Malov24 Jun 2025 14:56 UTC
13 points
2 comments2 min readLW link

The V&V method—A step to­wards safer AGI

Yoav Hollander24 Jun 2025 13:42 UTC
20 points
1 comment1 min readLW link
(blog.foretellix.com)