How Does A Blind Model See The Earth?

henry11 Aug 2025 19:58 UTC
473 points
38 comments7 min readLW link
(outsidetext.substack.com)

Four ways learn­ing Econ makes peo­ple dumber re: fu­ture AI

Steven Byrnes21 Aug 2025 17:52 UTC
359 points
49 comments6 min readLW link
(x.com)

AI In­duced Psy­chosis: A shal­low investigation

Tim Hua26 Aug 2025 20:03 UTC
357 points
43 comments26 min readLW link

The Problem

5 Aug 2025 21:40 UTC
313 points
218 comments26 min readLW link

How an­ti­ci­pa­tory cover-ups go wrong

Kaj_Sotala8 Aug 2025 10:26 UTC
295 points
25 comments6 min readLW link

Ban­ning Said Ach­miz (and broader thoughts on mod­er­a­tion)

habryka22 Aug 2025 23:02 UTC
244 points
395 comments30 min readLW link

Church Plant­ing: When Ven­ture Cap­i­tal Finds Jesus

Elizabeth16 Aug 2025 19:40 UTC
226 points
23 comments16 min readLW link
(acesounderglass.com)

An epistemic ad­van­tage of work­ing as a moderate

Buck20 Aug 2025 17:47 UTC
217 points
96 comments4 min readLW link

Emo­tions Make Sense

DaystarEld3 Aug 2025 7:03 UTC
207 points
40 comments21 min readLW link
(daystareld.com)

Hyper­bolic model fits METR ca­pa­bil­ities es­ti­mate worse than ex­po­nen­tial model

gjm19 Aug 2025 15:12 UTC
201 points
9 comments4 min readLW link

Will Any Crap Cause Emer­gent Misal­ign­ment?

J Bostock27 Aug 2025 18:20 UTC
192 points
37 comments3 min readLW link

Should you make stone tools?

Alex_Altair14 Aug 2025 0:15 UTC
190 points
48 comments3 min readLW link

Be­fore LLM Psy­chosis, There Was Yes-Man Psychosis

johnswentworth25 Aug 2025 17:47 UTC
186 points
20 comments3 min readLW link

Many pre­dic­tion mar­kets would be bet­ter off as batched auctions

William Howard2 Aug 2025 12:04 UTC
173 points
21 comments5 min readLW link
(antidiluvian.substack.com)

Some­body in­vented a bet­ter bookmark

Alex_Altair14 Aug 2025 17:57 UTC
173 points
22 comments2 min readLW link

My AGI timeline up­dates from GPT-5 (and 2025 so far)

ryan_greenblatt20 Aug 2025 16:11 UTC
163 points
14 comments4 min readLW link

Un­der­dog bias rules ev­ery­thing around me

Richard_Ngo17 Aug 2025 19:21 UTC
159 points
53 comments7 min readLW link
(www.mindthefuture.info)

Open Global In­vest­ment as a Gover­nance Model for AGI

Nick Bostrom27 Aug 2025 17:42 UTC
152 points
47 comments39 min readLW link
(nickbostrom.com)

My In­ter­view With Cade Metz on His Re­port­ing About Lighthaven

Zack_M_Davis17 Aug 2025 2:30 UTC
151 points
15 comments5 min readLW link

Re: re­cent An­thropic safety research

Eliezer Yudkowsky6 Aug 2025 22:52 UTC
145 points
22 comments5 min readLW link
(x.com)

METR’s Eval­u­a­tion of GPT-5

GradientDissenter7 Aug 2025 22:17 UTC
141 points
15 comments20 min readLW link
(metr.github.io)

The Inkhaven Residency

Ben Pace2 Aug 2025 18:51 UTC
134 points
35 comments3 min readLW link

Train­ing a Re­ward Hacker De­spite Perfect Labels

14 Aug 2025 23:57 UTC
132 points
45 comments4 min readLW link

SB-1047 Doc­u­men­tary: The Post-Mortem

Michaël Trazzi1 Aug 2025 21:42 UTC
130 points
0 comments5 min readLW link

(∃ Stochas­tic Nat­u­ral La­tent) Im­plies (∃ Deter­minis­tic Nat­u­ral La­tent)

22 Aug 2025 21:46 UTC
126 points
8 comments9 min readLW link

Towards Align­ment Au­dit­ing as a Num­bers-Go-Up Science

Sam Marks4 Aug 2025 22:30 UTC
123 points
15 comments6 min readLW link

Agent foun­da­tions: not re­ally math, not re­ally science

Alex_Altair17 Aug 2025 5:48 UTC
114 points
25 comments5 min readLW link

The Egyp­tian Mam­luks as case study for AI take-over

Buddenbroke19 Aug 2025 16:46 UTC
113 points
6 comments7 min readLW link

The Bone-Chilling Evil of Fac­tory Farm­ing

Bentham's Bulldog12 Aug 2025 18:02 UTC
109 points
11 comments6 min readLW link

Why Lat­ter-day Saints Have Strong Communities

Jeffrey Heninger17 Aug 2025 4:20 UTC
102 points
29 comments9 min readLW link

METR Re­search Up­date: Al­gorith­mic vs. Holis­tic Evaluation

David Rein13 Aug 2025 22:47 UTC
101 points
7 comments1 min readLW link
(metr.org)

[Question] In­scrutabil­ity was always in­evitable, right?

Steven Byrnes6 Aug 2025 21:57 UTC
99 points
33 comments2 min readLW link

At­tach­ing re­quire­ments to model re­leases has se­ri­ous down­sides (rel­a­tive to a differ­ent dead­line for these re­quire­ments)

ryan_greenblatt27 Aug 2025 17:04 UTC
99 points
2 comments3 min readLW link

Von Neu­mann’s Fal­lacy and You

incident-recipient28 Aug 2025 15:52 UTC
98 points
29 comments4 min readLW link

Yud­kowsky on “Don’t use p(doom)”

Raemon22 Aug 2025 23:44 UTC
98 points
39 comments4 min readLW link

Sum­mary of our Work­shop on Post-AGI Outcomes

29 Aug 2025 17:14 UTC
96 points
3 comments3 min readLW link

Gen­er­al­ized Com­ing Out Of The Closet

johnswentworth12 Aug 2025 21:38 UTC
92 points
51 comments4 min readLW link

Aes­thetic Prefer­ences Can Cause Emer­gent Misalignment

Anders Woodruff26 Aug 2025 18:41 UTC
90 points
16 comments3 min readLW link

Steve Petersen seek­ing funding

abramdemski1 Aug 2025 17:03 UTC
87 points
0 comments1 min readLW link

[An­thropic] A hacker used Claude Code to au­to­mate ransomware

bohaska27 Aug 2025 14:57 UTC
86 points
25 comments3 min readLW link
(www.anthropic.com)

A Com­pre­hen­sive Guide to Running

Declan Molony25 Aug 2025 15:12 UTC
85 points
24 comments16 min readLW link

Briefly on MAPLE, and the broader community

herschel19 Aug 2025 19:45 UTC
83 points
38 comments6 min readLW link

De­bug­ging for Mid Coders

Raemon16 Aug 2025 22:32 UTC
82 points
41 comments7 min readLW link

The Col­lider Bias The­ory of (Not Quite) Everything

Jack_S16 Aug 2025 16:53 UTC
82 points
3 comments10 min readLW link

The Dark Arts As A Scaf­fold­ing Skill For Rationality

Screwtape1 Aug 2025 17:12 UTC
82 points
25 comments7 min readLW link

Shorter To­kens Are More Likely

Brendan Long24 Aug 2025 0:22 UTC
81 points
19 comments5 min readLW link
(www.brendanlong.com)

Ar­gu­ments About AI Con­scious­ness Seem Highly Mo­ti­vated And At Best Overconfident

Zvi25 Aug 2025 13:20 UTC
81 points
5 comments25 min readLW link
(thezvi.wordpress.com)

My Least Liber­tar­ian Opinion: Ban Ex­clu­sivity Deals*

Brendan Long10 Aug 2025 21:41 UTC
78 points
17 comments2 min readLW link
(www.brendanlong.com)

On closed-door AI safety research

richbc18 Aug 2025 21:59 UTC
76 points
11 comments15 min readLW link

Mech In­terp Wiki Page and Why You Should Edit Wikipedia

12 Aug 2025 17:28 UTC
75 points
16 comments1 min readLW link