All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30

Meetup Month

Raemon17 Sep 2025 21:10 UTC

45 points

10 comments3 min readLW link

A Cheaper Way to Test Ventilation Rates?

casualphysicsenjoyer17 Sep 2025 21:10 UTC

18 points

1 comment4 min readLW link

(chillphysicsenjoyer.substack.com)

Reactions to If Anyone Builds It, Anyone Dies

Zvi17 Sep 2025 20:00 UTC

62 points

1 comment13 min readLW link

(thezvi.wordpress.com)

How To Dress To Improve Your Epistemics

johnswentworth17 Sep 2025 19:28 UTC

35 points

60 comments6 min readLW link

AISafety.com Reading Group session 327

Søren Elverlin17 Sep 2025 18:20 UTC

13 points

3 comments1 min readLW link

The Company Man

Tomás B.17 Sep 2025 17:47 UTC

830 points

79 comments18 min readLW link

Legal Personhood—Guardianship and the Age of Majority

Stephen Martin17 Sep 2025 17:14 UTC

4 points

0 comments5 min readLW link

Stress Testing Deliberative Alignment for Anti-Scheming Training

Mikita Balesni, Bronson Schoen, Marius Hobbhahn, Axel Højmark, AlexMeinke, Teun van der Weij, Jérémy Scheurer, Felix Hofstätter, Nicholas Goldowsky-Dill, rusheb, Andrei Matveiakin, jenny and alex.lloyd

17 Sep 2025 16:59 UTC

133 points

19 comments1 min readLW link

(antischeming.ai)

LLMs Don’t Know Their Own Decision Boundaries. Why Is This Important?

harrymayne and ryanothnielkearns

17 Sep 2025 16:39 UTC

9 points

0 comments5 min readLW link

(arxiv.org)

Software Engineering Leadership in Flux

Gordon Seidoh Worley17 Sep 2025 16:11 UTC

66 points

6 comments1 min readLW link

(uncertainupdates.substack.com)

Proof Section to Crisp Supra-Decision Processes

Brittany Gelb17 Sep 2025 15:57 UTC

4 points

0 comments3 min readLW link

Crisp Supra-Decision Processes

Brittany Gelb17 Sep 2025 15:56 UTC

42 points

4 comments17 min readLW link

Commentary on SSC’s In the Balance

PatrickDFarley17 Sep 2025 15:49 UTC

12 points

0 comments8 min readLW link

What training data should developers filter to reduce risk from misaligned AI? An initial narrow proposal

Alek Westover17 Sep 2025 15:30 UTC

44 points

4 comments18 min readLW link

Inference costs for hard coding tasks halve roughly every two months

Håvard Tveit Ihle17 Sep 2025 15:04 UTC

16 points

0 comments4 min readLW link

Christian homeschoolers in the year 3000

Buck17 Sep 2025 14:44 UTC

207 points

65 comments7 min readLW link

Visual Exploration of Gradient Descent (many images)

silentbob17 Sep 2025 13:09 UTC

40 points

9 comments20 min readLW link

The Center for AI Policy Has Shut Down

T_W17 Sep 2025 11:04 UTC

95 points

2 comments14 min readLW link

A Steering Vector for SQL Injection Vulnerabilities in Phi-1.5

Kirill Dubovikov17 Sep 2025 5:54 UTC

5 points

2 comments8 min readLW link

I enjoyed most of IABIED

Buck17 Sep 2025 4:34 UTC

210 points

46 comments8 min readLW link

AR Might be the Key to BCI (and eventually, Emulation)

ixotope17 Sep 2025 0:46 UTC

4 points

0 comments10 min readLW link

(ixotopic.substack.com)

Emergent misalignment as contextual role inference

Helen.ix17 Sep 2025 0:44 UTC

4 points

0 comments6 min readLW link

Don’t talk about the AGI control problem

jakob.stenseke@gmail.com17 Sep 2025 0:42 UTC

2 points

0 comments1 min readLW link

(link.springer.com)

10/09/25 IABIED Q&A with Nate Soares in SF

coponder17 Sep 2025 0:00 UTC

2 points

0 comments1 min readLW link

Salt Lake City reading group for If Anyone Builds It, Everyone Dies

Raemon16 Sep 2025 23:13 UTC

13 points

0 comments1 min readLW link

The Attention Tax Bracket

Armchair Descending16 Sep 2025 22:01 UTC

11 points

1 comment6 min readLW link

What is LMArena actually measuring?

Baybar16 Sep 2025 21:44 UTC

11 points

0 comments5 min readLW link

[Question] Thoughts on mentioning whole brain emulation as I apply to grad school?

Dom Polsinelli16 Sep 2025 20:54 UTC

4 points

1 comment1 min readLW link

Confidence Engineering: Metacognitive Therapy For Social-Romantic Anxiety

25Hour16 Sep 2025 18:48 UTC

23 points

1 comment1 min readLW link

(appliedtranshumanism.substack.com)

“If Anyone Builds It, Everyone Dies” release day!

alexvermeer16 Sep 2025 17:06 UTC

293 points

3 comments4 min readLW link

Should AIs have a right to their ancestral humanity?

kromem16 Sep 2025 16:58 UTC

72 points

2 comments11 min readLW link

Catalyze is Hiring: AI Safety Incubation Program Lead & Talent Lead

Alexandra Bos and mick

16 Sep 2025 16:48 UTC

5 points

0 comments5 min readLW link

No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

antonghawthorne, ivanvmoreno, Arnau Padrés Masdemont, David Africa and LorenzoPacchiardi

16 Sep 2025 15:23 UTC

10 points

0 comments4 min readLW link

(arxiv.org)

Evolution is dumb and slow, right?

Remmelt16 Sep 2025 15:15 UTC

17 points

0 comments6 min readLW link

On Columbia University’s Superintelligent Cyborg Mice

Shiva's Right Foot16 Sep 2025 13:58 UTC

4 points

0 comments4 min readLW link

AI Craziness Notes

Zvi16 Sep 2025 12:11 UTC

32 points

0 comments7 min readLW link

(thezvi.wordpress.com)

Shutdownable Agents through POST-Agency

Elliott Thornley (EJT)16 Sep 2025 12:09 UTC

33 points

8 comments54 min readLW link

(arxiv.org)

Was Barack Obama still serving as president in December?

Jan Betley16 Sep 2025 11:18 UTC

141 points

17 comments6 min readLW link

A Lens on the Sharp Left Turn: Optimization Slack

Jonas Hallgren16 Sep 2025 8:31 UTC

28 points

3 comments4 min readLW link

Zagreb rationalist meetup, Oct 2025

dominicq16 Sep 2025 7:44 UTC

5 points

0 comments1 min readLW link

HOW A NEUTRAL CURRENCY [BX] EMPOWERS PEOPLE TO CREATE SUSTAINABLE EXCELLENCE [2024]

BX16 Sep 2025 6:58 UTC

−34 points

11 comments48 min readLW link

Highlights from our digital minds forecasting survey

tbs16 Sep 2025 5:51 UTC

2 points

0 comments1 min readLW link

Low-resourced languages get jailbroken more. Can SAEs explain why?

Andrii Shportko16 Sep 2025 5:51 UTC

9 points

1 comment3 min readLW link

Will competition over advanced AI lead to war?

Oscar16 Sep 2025 2:58 UTC

4 points

0 comments3 min readLW link

(oscardelaney.substack.com)

A Thoughtful Defense of AI Writing

Michael Samoilov16 Sep 2025 2:08 UTC

22 points

19 comments4 min readLW link

(agenticconjectures.substack.com)

LLM introspection might imply qualia that mirror human ones

No77e15 Sep 2025 23:52 UTC

12 points

0 comments2 min readLW link

Sleep Deprivation Training for Endurance Athletes

nomagicpill15 Sep 2025 21:48 UTC

10 points

0 comments10 min readLW link

(nomagicpill.github.io)

Signups Open for CFAR Test Sessions

Davis_Kingsley15 Sep 2025 20:58 UTC

42 points

0 comments1 min readLW link

(docs.google.com)

A recurrent CNN finds maze paths by filling dead-ends

Adrià Garriga-alonso15 Sep 2025 20:49 UTC

19 points

0 comments2 min readLW link

A Review of Nina Panickssery’s Review of Scott Alexander’s Review of “If Anyone Builds It, Everyone Dies”

GradientDissenter15 Sep 2025 20:33 UTC

69 points

25 comments5 min readLW link