Yud­kowsky on “Don’t use p(doom)”

Raemon22 Aug 2025 23:44 UTC
98 points
39 comments4 min readLW link

Ban­ning Said Ach­miz (and broader thoughts on mod­er­a­tion)

habryka22 Aug 2025 23:02 UTC
244 points
395 comments30 min readLW link

(∃ Stochas­tic Nat­u­ral La­tent) Im­plies (∃ Deter­minis­tic Nat­u­ral La­tent)

22 Aug 2025 21:46 UTC
126 points
8 comments9 min readLW link

One more rea­son for AI ca­pa­ble of in­de­pen­dent moral rea­son­ing: al­ign­ment it­self and cause prioritisation

Michele Campolo22 Aug 2025 15:53 UTC
−3 points
0 comments3 min readLW link

The Bud­dhism & AI Initiative

Chris Scammell22 Aug 2025 15:50 UTC
29 points
2 comments2 min readLW link

Deep­Seek v3.1 Is Not Hav­ing a Moment

Zvi22 Aug 2025 15:50 UTC
40 points
2 comments3 min readLW link
(thezvi.wordpress.com)

Do­ing good… best?

Michele Campolo22 Aug 2025 15:48 UTC
−1 points
6 comments2 min readLW link

With enough knowl­edge, any con­scious agent acts morally

Michele Campolo22 Aug 2025 15:44 UTC
−2 points
9 comments36 min readLW link

If we can ed­u­cate AIs, why not ap­ply that ed­u­ca­tion to peo­ple?

P. João22 Aug 2025 14:04 UTC
5 points
0 comments2 min readLW link

CEO of Microsoft AI’s “Seem­ingly Con­scious AI” Post

Stephen Martin22 Aug 2025 13:58 UTC
64 points
8 comments8 min readLW link

Could we have pre­dicted emer­gent mis­al­ign­ment a pri­ori us­ing un­su­per­vised be­havi­our elic­i­ta­tion?

Daniel Tan22 Aug 2025 13:42 UTC
6 points
0 comments1 min readLW link

An In­tro­duc­tion to Credal Sets and In­fra-Bayes Learnability

Brittany Gelb22 Aug 2025 13:03 UTC
33 points
2 comments13 min readLW link

Le­gal Per­son­hood—Con­tracts (Part 2)

Stephen Martin22 Aug 2025 4:53 UTC
5 points
0 comments2 min readLW link

When Money Be­comes Power

Gabriel Alfour22 Aug 2025 4:14 UTC
61 points
16 comments7 min readLW link
(cognition.cafe)

Proof Sec­tion to an In­tro­duc­tion to Credal Sets and In­fra-Bayes Learnability

Brittany Gelb21 Aug 2025 23:11 UTC
13 points
0 comments10 min readLW link

Re­sam­pling Con­serves Re­dun­dancy (Ap­prox­i­mately)

21 Aug 2025 22:43 UTC
68 points
2 comments6 min readLW link

The anti-frag­ile culture

lincolnquirk21 Aug 2025 21:41 UTC
30 points
1 comment10 min readLW link

A Con­ser­va­tive Vi­sion For AI Alignment

21 Aug 2025 18:14 UTC
25 points
34 comments12 min readLW link

Emer­gent moral­ity in AI weak­ens the Orthog­o­nal­ity Thesis

dawnstrata21 Aug 2025 17:57 UTC
−1 points
3 comments11 min readLW link

Four ways learn­ing Econ makes peo­ple dumber re: fu­ture AI

Steven Byrnes21 Aug 2025 17:52 UTC
360 points
49 comments6 min readLW link
(x.com)

Me­mory De­cod­ing Jour­nal Club: Be­hav­ioral time scale synap­tic plas­tic­ity un­der­lies CA1 place fields

Devin Ward21 Aug 2025 16:13 UTC
1 point
0 comments1 min readLW link

Could one coun­try out­grow the rest of the world?

Tom Davidson21 Aug 2025 15:32 UTC
45 points
23 comments17 min readLW link
(newsletter.forethought.org)

What is “Mean­ing­ness”

21 Aug 2025 14:57 UTC
11 points
0 comments15 min readLW link

AI #130: Talk­ing Past The Sale

Zvi21 Aug 2025 13:50 UTC
37 points
4 comments60 min readLW link
(thezvi.wordpress.com)

Cri­tiques of FDT Often Stem From Con­fu­sion About New­comblike Problems

Heighn21 Aug 2025 13:19 UTC
7 points
19 comments5 min readLW link

Le­gal Per­son­hood—Con­tracts (Part 1)

Stephen Martin21 Aug 2025 5:23 UTC
10 points
0 comments7 min readLW link

Be­ing hon­est with AIs

Lukas Finnveden21 Aug 2025 3:57 UTC
63 points
6 comments17 min readLW link
(blog.redwoodresearch.org)

ACX Fall Meetup 2025 @ Klang Valley, Malaysia

Yi-Yang21 Aug 2025 3:34 UTC
2 points
0 comments1 min readLW link

French Non-Profit Law: As­so­ci­a­tions are as cool as Amer­i­can churches

Lucie Philippon20 Aug 2025 22:02 UTC
40 points
6 comments3 min readLW link

AI Safety Comms Retreat

Vishakha20 Aug 2025 20:54 UTC
3 points
0 comments1 min readLW link

The trou­ble with “en­light­en­ment”

Gordon Seidoh Worley20 Aug 2025 19:00 UTC
15 points
4 comments4 min readLW link
(uncertainupdates.substack.com)

An epistemic ad­van­tage of work­ing as a moderate

Buck20 Aug 2025 17:47 UTC
217 points
96 comments4 min readLW link

My AGI timeline up­dates from GPT-5 (and 2025 so far)

ryan_greenblatt20 Aug 2025 16:11 UTC
163 points
14 comments4 min readLW link

come work on dan­ger­ous ca­pa­bil­ity miti­ga­tions at Anthropic

Dave Orr20 Aug 2025 15:11 UTC
31 points
7 comments1 min readLW link

AI Com­pan­ion Conditions

Zvi20 Aug 2025 15:00 UTC
54 points
2 comments10 min readLW link
(thezvi.wordpress.com)

[Question] What to do with pre-or­der if I live in Rus­sia?

EniScien20 Aug 2025 13:39 UTC
10 points
1 comment2 min readLW link

Coh and the ripped codice, a tale of the horns effect

AdamLacerdo20 Aug 2025 10:10 UTC
−3 points
0 comments6 min readLW link

Lo­cal De­tours On A Nar­row Path: How might AI treaties fail in China?

Jack_S20 Aug 2025 9:09 UTC
21 points
0 comments14 min readLW link
(torchestogether.substack.com)

Le­gal Per­son­hood—Tort Li­a­bil­ity (Part 3)

Stephen Martin20 Aug 2025 6:33 UTC
4 points
0 comments2 min readLW link

A case against suc­ces­sion­ism for Galaxy-brain Gavin

Freddie19 Aug 2025 23:55 UTC
2 points
4 comments4 min readLW link

Distributed Multi-Armed Bandits

AlphaZard19 Aug 2025 23:46 UTC
19 points
2 comments6 min readLW link

Briefly on MAPLE, and the broader community

herschel19 Aug 2025 19:45 UTC
83 points
38 comments6 min readLW link

The Egyp­tian Mam­luks as case study for AI take-over

Buddenbroke19 Aug 2025 16:46 UTC
113 points
6 comments7 min readLW link

Do model eval­u­a­tions fall prey to the Good(er) Reg­u­la­tor The­o­rem?

testingthewaters19 Aug 2025 16:19 UTC
6 points
1 comment2 min readLW link

ACX au­tumn meetup in Madrid

a.olmotitos19 Aug 2025 16:09 UTC
1 point
2 comments1 min readLW link

What’s your AI think­ing?

Shoshannah Tekofsky19 Aug 2025 15:20 UTC
22 points
3 comments8 min readLW link
(theaidigest.org)

Hyper­bolic model fits METR ca­pa­bil­ities es­ti­mate worse than ex­po­nen­tial model

gjm19 Aug 2025 15:12 UTC
201 points
9 comments4 min readLW link

Two new AI fu­tures mapped out: Tool AI and d/​acc (linkpost to EA Fo­rum post)

19 Aug 2025 14:30 UTC
9 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Monthly Roundup #33: Au­gust 2025

Zvi19 Aug 2025 12:40 UTC
41 points
6 comments57 min readLW link
(thezvi.wordpress.com)

Cap­i­tal and Industry

ykevinzhang19 Aug 2025 12:35 UTC
15 points
2 comments10 min readLW link