How Does A Blind Model See The Earth?

henry11 Aug 2025 19:58 UTC
493 points
40 comments7 min readLW link
(outsidetext.substack.com)

AI In­duced Psy­chosis: A shal­low investigation

Tim Hua26 Aug 2025 20:03 UTC
365 points
46 comments26 min readLW link

Four ways learn­ing Econ makes peo­ple dumber re: fu­ture AI

Steven Byrnes21 Aug 2025 17:52 UTC
357 points
49 comments6 min readLW link
(x.com)

The Problem

5 Aug 2025 21:40 UTC
317 points
217 comments26 min readLW link

How an­ti­ci­pa­tory cover-ups go wrong

Kaj_Sotala8 Aug 2025 10:26 UTC
299 points
25 comments6 min readLW link

Ban­ning Said Ach­miz (and broader thoughts on mod­er­a­tion)

habryka22 Aug 2025 23:02 UTC
250 points
399 comments30 min readLW link

Church Plant­ing: When Ven­ture Cap­i­tal Finds Jesus

Elizabeth16 Aug 2025 19:40 UTC
234 points
23 comments16 min readLW link
(acesounderglass.com)

An epistemic ad­van­tage of work­ing as a moderate

Buck20 Aug 2025 17:47 UTC
215 points
96 comments4 min readLW link

Emo­tions Make Sense

DaystarEld3 Aug 2025 7:03 UTC
211 points
40 comments21 min readLW link
(daystareld.com)

Hyper­bolic model fits METR ca­pa­bil­ities es­ti­mate worse than ex­po­nen­tial model

gjm19 Aug 2025 15:12 UTC
202 points
9 comments4 min readLW link

Will Any Crap Cause Emer­gent Misal­ign­ment?

J Bostock27 Aug 2025 18:20 UTC
195 points
37 comments3 min readLW link

Should you make stone tools?

Alex_Altair14 Aug 2025 0:15 UTC
193 points
48 comments3 min readLW link

Be­fore LLM Psy­chosis, There Was Yes-Man Psychosis

johnswentworth25 Aug 2025 17:47 UTC
188 points
20 comments3 min readLW link

Some­body in­vented a bet­ter bookmark

Alex_Altair14 Aug 2025 17:57 UTC
177 points
23 comments2 min readLW link

Many pre­dic­tion mar­kets would be bet­ter off as batched auctions

William Howard2 Aug 2025 12:04 UTC
173 points
21 comments5 min readLW link
(antidiluvian.substack.com)

Un­der­dog bias rules ev­ery­thing around me

Richard_Ngo17 Aug 2025 19:21 UTC
169 points
54 comments7 min readLW link
(www.mindthefuture.info)

My AGI timeline up­dates from GPT-5 (and 2025 so far)

ryan_greenblatt20 Aug 2025 16:11 UTC
163 points
14 comments4 min readLW link

Open Global In­vest­ment as a Gover­nance Model for AGI

Nick Bostrom27 Aug 2025 17:42 UTC
153 points
48 comments39 min readLW link
(nickbostrom.com)

My In­ter­view With Cade Metz on His Re­port­ing About Lighthaven

Zack_M_Davis17 Aug 2025 2:30 UTC
152 points
15 comments5 min readLW link

Re: re­cent An­thropic safety research

Eliezer Yudkowsky6 Aug 2025 22:52 UTC
150 points
23 comments5 min readLW link
(x.com)

METR’s Eval­u­a­tion of GPT-5

GradientDissenter7 Aug 2025 22:17 UTC
141 points
15 comments20 min readLW link
(metr.github.io)

Train­ing a Re­ward Hacker De­spite Perfect Labels

14 Aug 2025 23:57 UTC
137 points
45 comments4 min readLW link

The Inkhaven Residency

Ben Pace2 Aug 2025 18:51 UTC
137 points
39 comments3 min readLW link

SB-1047 Doc­u­men­tary: The Post-Mortem

Michaël Trazzi1 Aug 2025 21:42 UTC
130 points
0 comments5 min readLW link

Towards Align­ment Au­dit­ing as a Num­bers-Go-Up Science

Sam Marks4 Aug 2025 22:30 UTC
127 points
15 comments6 min readLW link

(∃ Stochas­tic Nat­u­ral La­tent) Im­plies (∃ Deter­minis­tic Nat­u­ral La­tent)

22 Aug 2025 21:46 UTC
126 points
10 comments9 min readLW link

Agent foun­da­tions: not re­ally math, not re­ally science

Alex_Altair17 Aug 2025 5:48 UTC
119 points
29 comments5 min readLW link

The Egyp­tian Mam­luks as case study for AI take-over

Buddenbroke19 Aug 2025 16:46 UTC
113 points
6 comments7 min readLW link

The Bone-Chilling Evil of Fac­tory Farm­ing

Bentham's Bulldog12 Aug 2025 18:02 UTC
111 points
11 comments6 min readLW link

Sum­mary of our Work­shop on Post-AGI Outcomes

29 Aug 2025 17:14 UTC
107 points
3 comments3 min readLW link

Why Lat­ter-day Saints Have Strong Communities

Jeffrey Heninger17 Aug 2025 4:20 UTC
102 points
29 comments9 min readLW link

METR Re­search Up­date: Al­gorith­mic vs. Holis­tic Evaluation

David Rein13 Aug 2025 22:47 UTC
101 points
7 comments1 min readLW link
(metr.org)

Von Neu­mann’s Fal­lacy and You

incident-recipient28 Aug 2025 15:52 UTC
100 points
29 comments4 min readLW link

Yud­kowsky on “Don’t use p(doom)”

Raemon22 Aug 2025 23:44 UTC
100 points
40 comments4 min readLW link

[Question] In­scrutabil­ity was always in­evitable, right?

Steven Byrnes6 Aug 2025 21:57 UTC
99 points
33 comments2 min readLW link

At­tach­ing re­quire­ments to model re­leases has se­ri­ous down­sides (rel­a­tive to a differ­ent dead­line for these re­quire­ments)

ryan_greenblatt27 Aug 2025 17:04 UTC
99 points
2 comments3 min readLW link

Shorter To­kens Are More Likely

Brendan Long24 Aug 2025 0:22 UTC
98 points
19 comments5 min readLW link
(www.brendanlong.com)

Gen­er­al­ized Com­ing Out Of The Closet

johnswentworth12 Aug 2025 21:38 UTC
92 points
59 comments4 min readLW link

Aes­thetic Prefer­ences Can Cause Emer­gent Misalignment

Anders Woodruff26 Aug 2025 18:41 UTC
92 points
17 comments3 min readLW link

A Com­pre­hen­sive Guide to Running

Mr. Keating25 Aug 2025 15:12 UTC
88 points
24 comments16 min readLW link

Steve Petersen seek­ing funding

abramdemski1 Aug 2025 17:03 UTC
87 points
0 comments1 min readLW link

Per­ma­nent Disem­pow­er­ment is the Baseline

Vladimir_Nesov4 Aug 2025 17:43 UTC
87 points
23 comments6 min readLW link

[An­thropic] A hacker used Claude Code to au­to­mate ransomware

bohaska27 Aug 2025 14:57 UTC
86 points
25 comments3 min readLW link
(www.anthropic.com)

The Dark Arts As A Scaf­fold­ing Skill For Rationality

Screwtape1 Aug 2025 17:12 UTC
85 points
25 comments7 min readLW link

Ar­gu­ments About AI Con­scious­ness Seem Highly Mo­ti­vated And At Best Overconfident

Zvi25 Aug 2025 13:20 UTC
84 points
5 comments25 min readLW link
(thezvi.wordpress.com)

Briefly on MAPLE, and the broader community

herschel19 Aug 2025 19:45 UTC
83 points
38 comments6 min readLW link

De­bug­ging for Mid Coders

Raemon16 Aug 2025 22:32 UTC
82 points
41 comments7 min readLW link

The Col­lider Bias The­ory of (Not Quite) Everything

Jack_S16 Aug 2025 16:53 UTC
82 points
3 comments10 min readLW link

Say­ing Goodbye

sapphire3 Aug 2025 23:52 UTC
79 points
75 comments4 min readLW link

My Least Liber­tar­ian Opinion: Ban Ex­clu­sivity Deals*

Brendan Long10 Aug 2025 21:41 UTC
78 points
17 comments2 min readLW link
(www.brendanlong.com)