All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 282930

Yet Another IABIED Review

PeterMcCluskey28 Sep 2025 21:36 UTC

15 points

0 comments7 min readLW link

(bayesianinvestor.com)

A non-review of “If Anyone Builds It, Everyone Dies”

Boaz Barak28 Sep 2025 17:34 UTC

124 points

51 comments4 min readLW link

Transgender Sticker Fallacy

ymeskhout28 Sep 2025 16:54 UTC

123 points

28 comments7 min readLW link

(www.ymeskhout.com)

Solving the problem of needing to give a talk

Kaj_Sotala28 Sep 2025 15:34 UTC

60 points

3 comments8 min readLW link

Lessons from organizing a technical AI safety bootcamp

Vili Kohonen and Dmitrii Gusev

28 Sep 2025 13:48 UTC

16 points

3 comments16 min readLW link

The Risk of Human Disconnection

Priyanka Bharadwaj28 Sep 2025 2:14 UTC

5 points

0 comments3 min readLW link

A Reply to MacAskill on “If Anyone Builds It, Everyone Dies”

Rob Bensinger27 Sep 2025 23:03 UTC

55 points

21 comments17 min readLW link

The Sensible Way Forward for AI Alignment

Davey Morse27 Sep 2025 21:00 UTC

−3 points

0 comments3 min readLW link

Book Review: The System

Julius27 Sep 2025 20:49 UTC

14 points

2 comments16 min readLW link

(thegreymatter.substack.com)

Learnings from AI safety course so far

Boaz Barak27 Sep 2025 18:17 UTC

111 points

6 comments3 min readLW link

My Weirdest Experience Wasn’t

Bridgett Kay27 Sep 2025 18:01 UTC

31 points

4 comments3 min readLW link

(dxmrevealed.wordpress.com)

Making sense of parameter-space decomposition

Malmesbury27 Sep 2025 17:37 UTC

51 points

0 comments19 min readLW link

AI Safety Field Growth Analysis 2025

Stephen McAleese27 Sep 2025 17:03 UTC

30 points

13 comments3 min readLW link

2025 Petrov day speech

nick lacombe27 Sep 2025 15:07 UTC

9 points

0 comments1 min readLW link

(nikthink.net)

LLMs Suck at Deep Thinking Part 3 - Trying to Prove It (fixed)

Taylor G. Lunt27 Sep 2025 14:54 UTC

17 points

6 comments15 min readLW link

Our Beloved Monsters

Tomás B.27 Sep 2025 13:25 UTC

71 points

4 comments11 min readLW link

Ranking the endgames of AI development

Sean Herrington27 Sep 2025 11:47 UTC

20 points

4 comments5 min readLW link

An N=1 observational study on interpretability of Natural General Intelligence (NGI)

dr_s27 Sep 2025 9:28 UTC

12 points

3 comments6 min readLW link

Day #14 Hunger Strike, on livestream, In protest of Superintelligent AI

samuelshadrach27 Sep 2025 9:16 UTC

2 points

0 comments2 min readLW link

[CS 2881r] [Week 3] Adversarial Robustness, Jailbreaks, Prompt Injection, Security

egeckr27 Sep 2025 1:31 UTC

3 points

0 comments26 min readLW link

Narrative Structure And The Principle Of Least Action

sonicrocketman27 Sep 2025 1:31 UTC

1 point

1 comment3 min readLW link

(brianschrader.com)

Exploring belief states in LLM chains of thought

emanuelr27 Sep 2025 1:09 UTC

4 points

2 comments7 min readLW link

Rehearsing the Future: Tabletop Exercises for Risks, and Readiness

bhishma and AliPat

27 Sep 2025 0:50 UTC

17 points

0 comments3 min readLW link

AI Safety Isn’t So Unique

Baram Sosis27 Sep 2025 0:36 UTC

11 points

1 comment9 min readLW link

Anthropic Economic Index report

anaguma26 Sep 2025 23:49 UTC

4 points

0 comments4 min readLW link

(www.anthropic.com)

Someone Will Build It

entirelyalive26 Sep 2025 23:39 UTC

−1 points

0 comments12 min readLW link

Reasons to sell frontier lab equity to donate now rather than later

Daniel_Eth, Ethan Perez and ryan_greenblatt

26 Sep 2025 23:07 UTC

245 points

34 comments12 min readLW link

Comparative Analysis of Black Box Methods for Detecting Evaluation Awareness in LLMs

Igor Ivanov26 Sep 2025 21:56 UTC

17 points

0 comments14 min readLW link

Mechanism design of yet another median world

Greenless Mirror26 Sep 2025 21:51 UTC

4 points

2 comments10 min readLW link

Metaculus is Hiring a Head of Consulting Services

ChristianWilliams26 Sep 2025 21:43 UTC

7 points

0 comments2 min readLW link

(apply.workable.com)

The Illustrated Petrov Day Ceremony

Raemon26 Sep 2025 21:01 UTC

93 points

11 comments2 min readLW link

Experiments with Futarchy

Ben S.26 Sep 2025 18:27 UTC

5 points

0 comments7 min readLW link

(news.manifold.markets)

Human in the Loop: on Losing Control of Autonomous Systems

Nostradamus_226 Sep 2025 18:27 UTC

3 points

0 comments9 min readLW link

(terminalvel0city.substack.com)

Synthesizing Standalone World-Models, Part 4: Metaphysical Justifications

Thane Ruthenis26 Sep 2025 18:00 UTC

23 points

9 comments4 min readLW link

On keeping chains of thought monitorable

Oscar26 Sep 2025 16:30 UTC

10 points

0 comments3 min readLW link

IABIED Misc. Discussion Thread

WilliamKiely26 Sep 2025 16:22 UTC

5 points

5 comments1 min readLW link

Economics Roundup #6

Zvi26 Sep 2025 14:10 UTC

19 points

5 comments15 min readLW link

(thezvi.wordpress.com)

The AI Village in Numbers

Shoshannah Tekofsky26 Sep 2025 13:40 UTC

6 points

0 comments4 min readLW link

(theaidigest.org)

What Happened After My Rat Group Backed Kamala Harris

Blake26 Sep 2025 12:39 UTC

39 points

3 comments1 min readLW link

[Question] Feedback request: Is the time right for an AI Safety stack exchange?

lennie26 Sep 2025 9:14 UTC

22 points

0 comments4 min readLW link

Constrained Belief Updates and Geometric Structures in Transformer Representations for the RRXOR Process

bgradowhite26 Sep 2025 1:25 UTC

6 points

0 comments11 min readLW link

[CS 2881r AI Safety] [Week 2] Modern LLM Training

jusyc26 Sep 2025 1:25 UTC

3 points

0 comments4 min readLW link

CFAR update, and New CFAR workshops

AnnaSalamon25 Sep 2025 21:12 UTC

201 points

54 comments8 min readLW link

Making Sense of Consciousness Part 5: Consciousness and the Self

sarahconstantin25 Sep 2025 21:10 UTC

13 points

0 comments10 min readLW link

(sarahconstantin.substack.com)

Widening AI Safety’s talent pipeline by meeting people where they are

Ruben Castaing, yanni kyriacos, Nelson Gardner-Challis and danwil

25 Sep 2025 20:50 UTC

33 points

3 comments8 min readLW link

Synthesizing Standalone World-Models, Part 3: Dataset-Assembly

Thane Ruthenis25 Sep 2025 19:21 UTC

13 points

2 comments2 min readLW link

Why you should eat meat—even if you hate factory farming

KatWoods25 Sep 2025 15:39 UTC

314 points

96 comments10 min readLW link

What GPT-oss Leaks About OpenAI’s Training Data

Lennart Finke25 Sep 2025 15:33 UTC

27 points

8 comments6 min readLW link

The real AI deploys itself

David Scott Krueger (formerly: capybaralet)25 Sep 2025 14:11 UTC

76 points

8 comments3 min readLW link

(therealartificialintelligence.substack.com)

AI #135: OpenAI Shows Us The Money

Zvi25 Sep 2025 13:40 UTC

26 points

2 comments44 min readLW link

(thezvi.wordpress.com)