LLMs might have sub­jec­tive ex­pe­riences, but no con­cepts for them

No77e2 Jun 2025 21:18 UTC
17 points
5 comments2 min readLW link

In defense of memes (and thought-ter­mi­nat­ing clichés)

Harjas2 Jun 2025 20:18 UTC
11 points
4 comments10 min readLW link

He­donic adap­ta­tion: you should not seeks pleasure

Crazy philosopher2 Jun 2025 19:23 UTC
0 points
6 comments2 min readLW link

Un­faith­ful Rea­son­ing Can Fool Chain-of-Thought Monitoring

2 Jun 2025 19:08 UTC
78 points
17 comments3 min readLW link

Frank Her­bert’s great in­sight into hu­man agency—Muad’Dib the tool?

Nerret2 Jun 2025 18:52 UTC
2 points
1 comment1 min readLW link

Hem­ing­way Case

Martin Sustrik2 Jun 2025 18:50 UTC
19 points
2 comments1 min readLW link
(www.250bpm.com)

[Question] What AI apps are sur­pris­ingly ab­sent given cur­rent ca­pa­bil­ities?

azergante2 Jun 2025 18:46 UTC
4 points
8 comments1 min readLW link

[Be­neath Psy­chol­ogy] Chronic pain challenge part 2: the solution

jimmy2 Jun 2025 17:30 UTC
39 points
3 comments34 min readLW link

The Value Propo­si­tion of Ro­man­tic Relationships

johnswentworth2 Jun 2025 13:51 UTC
208 points
43 comments13 min readLW link

1. The challenge of un­aware­ness for im­par­tial al­tru­ist ac­tion guidance: Introduction

Anthony DiGiovanni2 Jun 2025 8:54 UTC
48 points
6 comments13 min readLW link

‘Wicked’: thoughts

KatjaGrace2 Jun 2025 6:20 UTC
25 points
3 comments3 min readLW link
(worldspiritsockpuppet.com)

Hu­man­ity needs a Ulysses Pact for AI

Lukas N.P. Egger1 Jun 2025 20:56 UTC
1 point
2 comments1 min readLW link

Text Steers Vision

Woody Gan1 Jun 2025 20:30 UTC
5 points
0 comments7 min readLW link

[Question] Pos­si­ble AI reg­u­la­tion emer­gency?

CronoDAS1 Jun 2025 20:30 UTC
19 points
1 comment1 min readLW link

Eliezer Yud­kowsky & Con­nor Leahy | AI Risk, Safety & Align­ment Q&A [4K Re­mas­ter + HQ Au­dio]

Dex Volkov1 Jun 2025 20:20 UTC
−8 points
2 comments1 min readLW link
(www.youtube.com)

Own­er­ship: the prin­ci­ple of “Deprive first, ask ques­tions later”

MillardJMelnyk1 Jun 2025 20:19 UTC
−27 points
22 comments1 min readLW link

Economists should track the speed and mag­ni­tude of AI im­ple­men­ta­tion projects

ParrotRobot1 Jun 2025 20:15 UTC
3 points
0 comments2 min readLW link

Ingroup

JenniferRM1 Jun 2025 19:47 UTC
−3 points
12 comments1 min readLW link

Ap­ply to the AI Se­cu­rity Boot­camp [Aug 4 - Aug 29]

1 Jun 2025 19:47 UTC
27 points
2 comments4 min readLW link

See­ing how well an agen­tic AI cod­ing tool can do com­pared to me us­ing an ac­tual real-world example

Massimog1 Jun 2025 19:24 UTC
32 points
2 comments1 min readLW link
(blog.massimogauthier.com)

Ni­co­tine ad­dic­tion, cloves, and need­ing to take a shit

eyesack1 Jun 2025 19:13 UTC
4 points
1 comment1 min readLW link

2nd Ger­many-wide ACX/​LW event

Fernand01 Jun 2025 13:56 UTC
1 point
0 comments1 min readLW link

An Opinionated Guide to P-Values

amitlevy491 Jun 2025 11:48 UTC
11 points
0 comments8 min readLW link
(ivy0.substack.com)

Le­gal Per­son­hood for Models: Novelli et. al & Mocanu

Stephen Martin1 Jun 2025 8:18 UTC
2 points
0 comments10 min readLW link

Is Es­ca­la­tion Inevitable?

Lennart Wijers31 May 2025 22:10 UTC
5 points
0 comments3 min readLW link

Policy En­tropy, Learn­ing, and Align­ment (Or Maybe Your LLM Needs Ther­apy)

sdeture31 May 2025 22:09 UTC
15 points
6 comments8 min readLW link

The Unseen Hand: AI’s Prob­lem Preemp­tion and the True Fu­ture of Labor

Ben Kassan31 May 2025 22:04 UTC
8 points
0 comments20 min readLW link

The 80/​20 play­book for miti­gat­ing AI schem­ing in 2025

Charbel-Raphaël31 May 2025 21:17 UTC
40 points
2 comments4 min readLW link

Col­lec­tive Ac­tion for AI Safety (June 4, NYC)

Jordan Braunstein31 May 2025 20:27 UTC
1 point
0 comments1 min readLW link

The best ap­proaches for miti­gat­ing “the in­tel­li­gence curse” (or grad­ual dis­em­pow­er­ment); my quick guesses at the best ob­ject-level interventions

ryan_greenblatt31 May 2025 18:20 UTC
78 points
19 comments5 min readLW link

Would It Be Bet­ter to Dispense with Good and Evil?

arusarda31 May 2025 16:40 UTC
−2 points
10 comments6 min readLW link

How Epistemic Col­lapse Looks from Inside

Martin Sustrik31 May 2025 16:30 UTC
9 points
11 comments1 min readLW link
(250bpm.substack.com)

When will AI au­to­mate all men­tal work, and how fast?

31 May 2025 16:18 UTC
10 points
0 comments7 min readLW link
(youtu.be)

Progress links and short notes, 2025-05-31: RPI fel­low­ship dead­line to­mor­row, Edge Es­mer­alda next week, and more

jasoncrawford31 May 2025 15:20 UTC
11 points
0 comments7 min readLW link
(newsletter.rootsofprogress.org)

House Party Dances

jefftk31 May 2025 15:20 UTC
13 points
1 comment1 min readLW link
(www.jefftk.com)

Free Will, Like Prob­a­bil­ity, is About Lo­cal Knowledge

Rob Lucas31 May 2025 14:19 UTC
4 points
6 comments16 min readLW link
(open.substack.com)

The (Unoffi­cial) Ra­tion­al­ity: A-Z Anki Deck

japancolorado31 May 2025 7:01 UTC
30 points
8 comments1 min readLW link

Zochi Pub­lishes A* Paper

mannatvjain31 May 2025 0:00 UTC
12 points
0 comments4 min readLW link
(www.intology.ai)

Me­mory De­cod­ing Jour­nal Club: Struc­ture and func­tion of the hip­pocam­pal CA3 module

Devin Ward30 May 2025 23:59 UTC
1 point
0 comments1 min readLW link

Di­a­betes is Caused by Ox­ida­tive Stress

Lorec30 May 2025 21:03 UTC
11 points
11 comments8 min readLW link

Too Many Me­taphors: A Case for Plain Talk in AI Safety

David Harket30 May 2025 19:29 UTC
1 point
8 comments2 min readLW link

[Question] Could we go an­other route with com­put­ers?

Roman Malov30 May 2025 19:04 UTC
13 points
5 comments1 min readLW link

Aris­totelian Op­ti­miza­tion: The Eco­nomics of Cameralism

Edward Könings30 May 2025 19:02 UTC
−2 points
1 comment13 min readLW link

I repli­cated the An­thropic al­ign­ment fak­ing ex­per­i­ment on other mod­els, and they didn’t fake alignment

30 May 2025 18:57 UTC
35 points
0 comments2 min readLW link

‘GiveWell for AI Safety’: Les­sons learned in a week

Lydia Nottingham30 May 2025 18:38 UTC
41 points
0 comments6 min readLW link

Idea Gen­er­a­tion and Sifting

belos30 May 2025 16:59 UTC
1 point
0 comments20 min readLW link
(bestofagreatlot.substack.com)

50 Ideas for Life I Re­peat­edly Share

DMMF30 May 2025 16:57 UTC
26 points
9 comments15 min readLW link
(notnottalmud.substack.com)

Virtues re­lated to honesty

Orioth30 May 2025 14:11 UTC
11 points
23 comments2 min readLW link

AI 2027 - Rogue Repli­ca­tion Timeline

Alvin Ånestrand30 May 2025 13:46 UTC
41 points
3 comments7 min readLW link
(forecastingaifutures.substack.com)

Let­ting Kids Be Kids

Zvi30 May 2025 10:50 UTC
86 points
15 comments20 min readLW link
(thezvi.wordpress.com)