GPT-1 was a comedic genius

anaguma22 Sep 2025 22:19 UTC
5 points
3 comments4 min readLW link

D&D.Sci: Se­rial Heal­ers [Eval­u­a­tion & Rule­set]

abstractapplic22 Sep 2025 20:02 UTC
40 points
7 comments4 min readLW link

Re­search Agenda: Syn­the­siz­ing Stan­dalone World-Models

Thane Ruthenis22 Sep 2025 19:06 UTC
69 points
28 comments11 min readLW link

Global Call for AI Red Lines—Signed by No­bel Lau­re­ates, Former Heads of State, and 200+ Promi­nent Figures

Charbel-Raphaël22 Sep 2025 18:22 UTC
333 points
27 comments6 min readLW link

H1-B And The $100k Fee

Zvi22 Sep 2025 18:10 UTC
30 points
1 comment17 min readLW link
(thezvi.wordpress.com)

Why I don’t be­lieve Su­per­al­ign­ment will work

Simon Lermen22 Sep 2025 17:10 UTC
44 points
6 comments5 min readLW link

Video and tran­script of talk on giv­ing AIs safe motivations

Joe Carlsmith22 Sep 2025 16:43 UTC
12 points
0 comments50 min readLW link

Re­ject­ing Violence as an AI Safety Strategy

James_Miller22 Sep 2025 16:34 UTC
58 points
5 comments3 min readLW link

Fo­cus trans­parency on risk re­ports, not safety cases

ryan_greenblatt22 Sep 2025 15:27 UTC
47 points
3 comments6 min readLW link

The world’s first fron­tier AI reg­u­la­tion is sur­pris­ingly thought­ful: the EU’s Code of Practice

MKodama22 Sep 2025 15:23 UTC
75 points
0 comments15 min readLW link

Some of the ways the IABIED plan can backfire

mishka22 Sep 2025 15:02 UTC
19 points
16 comments2 min readLW link

Re­lat­ing to AI, Re­lat­ing to Ourselves

22 Sep 2025 8:18 UTC
2 points
1 comment2 min readLW link

Warmth, Light, Flame

Alice Blair22 Sep 2025 4:19 UTC
37 points
0 comments4 min readLW link

This is a re­view of the reviews

Recurrented22 Sep 2025 3:11 UTC
184 points
57 comments2 min readLW link

Incommensurability

Christopher James Hart22 Sep 2025 2:21 UTC
26 points
6 comments1 min readLW link

You Can’t Really Bet on Doom

Jack_S21 Sep 2025 23:27 UTC
8 points
1 comment7 min readLW link
(torchestogether.substack.com)

The Only Red Line

Jason Reid21 Sep 2025 22:40 UTC
13 points
1 comment1 min readLW link

Do LLMs Change Their Minds About Their Users… and Know It?

Ishaan Sinha21 Sep 2025 22:38 UTC
10 points
2 comments14 min readLW link

Me­tacrisis as a Frame­work for AI Governance

Jonah Wilberg21 Sep 2025 21:30 UTC
20 points
0 comments8 min readLW link

Is there not le­gi­t­i­mate dis­agree­ment about this premise of IABI,ED?

enfascination21 Sep 2025 20:47 UTC
5 points
7 comments1 min readLW link

Evals in the Age of Jarvis

Dinkar Juyal21 Sep 2025 19:27 UTC
3 points
2 comments3 min readLW link

[Question] Could China Unilat­er­ally Cause an AI Pause?

Maloew21 Sep 2025 18:37 UTC
22 points
2 comments1 min readLW link

What do peo­ple mean when they say that some­thing will be­come more like a util­ity max­i­mizer?

Nina Panickssery21 Sep 2025 16:03 UTC
40 points
7 comments2 min readLW link

And Yet, Defend your Thoughts from AI Writing

Michael Samoilov21 Sep 2025 15:52 UTC
60 points
17 comments6 min readLW link
(open.substack.com)

A parable of re­al­ism and relativism

kwang21 Sep 2025 14:47 UTC
−7 points
2 comments2 min readLW link
(kevw.substack.com)

ACX/​LW Oc­to­ber Paris Meetup

Lucie Philippon21 Sep 2025 11:37 UTC
5 points
0 comments1 min readLW link

Day #8 Hunger Strike, Protest Against Su­per­in­tel­li­gent AI

samuelshadrach21 Sep 2025 5:58 UTC
13 points
4 comments2 min readLW link

FTX, Golden Geese, and The Wi­dow’s Mite

Elizabeth20 Sep 2025 18:30 UTC
21 points
1 comment7 min readLW link
(acesounderglass.com)

The Case for a Pro-AI-Safety Poli­ti­cal Party in the US

Oliver Kuperman20 Sep 2025 16:35 UTC
11 points
2 comments21 min readLW link

Con­tra Col­lier on IABIED

Max Harms20 Sep 2025 15:55 UTC
227 points
51 comments20 min readLW link

As­tral­codex­ten IRB his­tory error

Paul Crowley20 Sep 2025 15:28 UTC
36 points
0 comments2 min readLW link

“Pas­tor Selfie,” How would you teach what you learn about ra­tio­nal­ity from scratch to the gen­eral pub­lic?

P. João20 Sep 2025 12:43 UTC
1 point
1 comment1 min readLW link

The Prob­lem with Defin­ing an “AGI Ban” by Out­come (a lawyer’s take).

Katalina Hernandez20 Sep 2025 11:01 UTC
239 points
63 comments5 min readLW link

The ti­tle is reasonable

Raemon20 Sep 2025 8:59 UTC
194 points
128 comments18 min readLW link

An Eco­nomic Model of Modern Dating

gladman20 Sep 2025 2:17 UTC
4 points
0 comments4 min readLW link

An­nounc­ing “The Real AI”: a blog

David Scott Krueger (formerly: capybaralet)20 Sep 2025 1:27 UTC
32 points
1 comment2 min readLW link
(therealartificialintelligence.substack.com)

Ex­tend­ing In­spect Frame­work: In­te­grat­ing Weights & Biases

20 Sep 2025 1:10 UTC
2 points
0 comments3 min readLW link

[Question] Look­ing for a ray of hope in IABIED

Rich Mansfield20 Sep 2025 0:53 UTC
11 points
3 comments1 min readLW link

Me­mory De­cod­ing Jour­nal Club: Distinct synap­tic plas­tic­ity rules op­er­ate across den­dritic com­part­ments in vivo dur­ing learning

Devin Ward20 Sep 2025 0:50 UTC
1 point
0 comments1 min readLW link

Beliefs and JavaScript types

Adam Zerner20 Sep 2025 0:48 UTC
10 points
6 comments6 min readLW link

AI Lob­by­ing is Not Normal

Algon20 Sep 2025 0:23 UTC
131 points
11 comments3 min readLW link
(x.com)

Be­ware LLMs’ patholog­i­cal guardrailing

lc19 Sep 2025 20:55 UTC
20 points
1 comment1 min readLW link

Safety re­searchers should take a pub­lic stance

19 Sep 2025 18:55 UTC
230 points
65 comments8 min readLW link

Day 16 Hunger Strike—Guido Re­ich­stader Interviewed

samuelshadrach19 Sep 2025 17:30 UTC
9 points
0 comments1 min readLW link

Prospects for study­ing ac­tual schemers

19 Sep 2025 14:11 UTC
40 points
0 comments58 min readLW link

Book Re­view: If Any­one Builds It, Every­one Dies

Zvi19 Sep 2025 11:30 UTC
61 points
3 comments31 min readLW link
(thezvi.wordpress.com)

How peo­ple poli­ti­cally con­front the Modern Eldritch

19 Sep 2025 10:18 UTC
5 points
0 comments14 min readLW link
(cognition.cafe)

My Minor AI Safety Re­search Pro­jects (Q3 2025)

Adam Newgas19 Sep 2025 9:53 UTC
6 points
1 comment2 min readLW link

Book Re­view: If Any­one Builds It, Every­one Dies

Nina Panickssery19 Sep 2025 4:50 UTC
41 points
1 comment11 min readLW link
(blog.ninapanickssery.com)

Me­mory De­cod­ing Jour­nal Club: Distinct synap­tic plas­tic­ity rules op­er­ate across den­dritic com­part­ments in vivo dur­ing learning

Devin Ward19 Sep 2025 4:17 UTC
3 points
0 comments1 min readLW link