All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Arusha Perpetual Chicken—an unlikely iterated game

James Stephen Brown6 Apr 2025 22:56 UTC

15 points

1 comment5 min readLW link

(nonzerosum.games)

How Gay is the Vatican?

rba6 Apr 2025 21:27 UTC

63 points

34 comments7 min readLW link

Australia’s AI Crossroads: Election 2025 Town Hall

Peter Horniak6 Apr 2025 21:17 UTC

1 point

0 comments1 min readLW link

The Lizardman and the Black Hat Bobcat

Screwtape6 Apr 2025 19:02 UTC

113 points

15 comments9 min readLW link

Would this solve the (outer) alignment problem, or at least help?

Wes R6 Apr 2025 18:49 UTC

−2 points

1 comment13 min readLW link

[Question] What are the fundamental differences between teaching the AIs and humans?

StanislavKrym6 Apr 2025 18:17 UTC

3 points

0 comments1 min readLW link

An “Optimistic” 2027 Timeline

Yitz6 Apr 2025 16:39 UTC

13 points

16 comments9 min readLW link

Thoughts on Creating a Good Language

Towards_Keeperhood6 Apr 2025 15:57 UTC

1 point

2 comments7 min readLW link

The REPHRASE Circuit: How Fine-Tuning Enhances LLMs to REPHRASE Text

Karthik Viswanathan6 Apr 2025 15:02 UTC

4 points

0 comments5 min readLW link

[Research sprint] Single-model crosscoder feature ablation and steering

Thomas Read6 Apr 2025 14:42 UTC

11 points

0 comments12 min readLW link

Ferrer, Pilar, and Me

Askwho6 Apr 2025 11:22 UTC

23 points

1 comment4 min readLW link

(open.substack.com)

FlexChunk: Enabling 100M×100M Out-of-Core SpMV (~1.8 min, ~1.7 GB RAM) with Near-Linear Scaling

Daniil Strizhov6 Apr 2025 5:27 UTC

1 point

0 comments7 min readLW link

A collection of approaches to confronting doom, and my thoughts on them

Ruby6 Apr 2025 2:11 UTC

48 points

18 comments12 min readLW link

A Slow Guide to Confronting Doom

Ruby6 Apr 2025 2:10 UTC

87 points

20 comments14 min readLW link

[Linkpost] Visual roadmap to strong human germline engineering

TsviBT5 Apr 2025 22:22 UTC

30 points

0 comments1 min readLW link

Google DeepMind: An Approach to Technical AGI Safety and Security

Rohin Shah5 Apr 2025 22:00 UTC

75 points

12 comments18 min readLW link

(arxiv.org)

Introduction to Representing Sentences as Logical Statements

Towards_Keeperhood5 Apr 2025 20:35 UTC

33 points

10 comments16 min readLW link

Memory Decoding Journal Club: A collaboration of the Carboncopies Foundation and BPF Aspirational Neuroscience

Devin Ward5 Apr 2025 20:27 UTC

1 point

0 comments1 min readLW link

Meta releases Llama-4 herd of models

winstonBosan5 Apr 2025 19:51 UTC

14 points

5 comments1 min readLW link

Against podcasts

Biff Wiff5 Apr 2025 19:20 UTC

39 points

19 comments4 min readLW link

What are Responsible Scaling Policies (RSPs)?

Vishakha and Algon

5 Apr 2025 16:01 UTC

3 points

0 comments1 min readLW link

(aisafety.info)

What does Yann LeCun think about AGI? A summary of his talk, “Mathematical Obstacles on the Way to Human-Level AI”

Adam Jones5 Apr 2025 12:21 UTC

16 points

0 comments2 min readLW link

I Have No Mouth but I Must Speak

Jack5 Apr 2025 7:42 UTC

7 points

8 comments8 min readLW link

Prediction Markets Are Mediocre

Ape in the coat5 Apr 2025 6:54 UTC

3 points

13 comments3 min readLW link

Among Us: A Sandbox for Agentic Deception

7vik and Adrià Garriga-alonso

5 Apr 2025 6:24 UTC

114 points

7 comments7 min readLW link

Ai Cone of Probabilties—what aren’t we talking about?

Marzipan5 Apr 2025 5:51 UTC

−10 points

5 comments2 min readLW link

Quarter Inch Cables are Devious

jefftk5 Apr 2025 2:40 UTC

13 points

4 comments1 min readLW link

(www.jefftk.com)

Most Questionable Details in ‘AI 2027’

Commander Zander5 Apr 2025 0:32 UTC

35 points

12 comments6 min readLW link

Karel Čapek’s ‘War with the Newts’ 1936 review

Petr 'Margot' Andreev4 Apr 2025 23:12 UTC

−10 points

1 comment1 min readLW link

How much progress actually happens in theoretical physics?

ChristianKl4 Apr 2025 23:08 UTC

32 points

33 comments1 min readLW link

Self-Replication: AI already can do it

Andrey Seryakov4 Apr 2025 22:37 UTC

13 points

0 comments5 min readLW link

Join Vitalist Bay: An 8-Week Longevity and Radical Life Extension Event Series in Berkeley (April-May 2025)

Vitaran4 Apr 2025 21:03 UTC

6 points

0 comments1 min readLW link

Sleep peacefully: no hidden reasoning detected in LLMs. Well, at least in small ones.

Ilia Shirokov and Ilya Nachevsky

4 Apr 2025 20:49 UTC

17 points

4 comments7 min readLW link

AI companies’ unmonitored internal AI use poses serious risks

sjadler4 Apr 2025 18:17 UTC

13 points

2 comments1 min readLW link

(stevenadler.substack.com)

Will compute bottlenecks prevent a software intelligence explosion?

Tom Davidson4 Apr 2025 17:41 UTC

78 points

25 comments12 min readLW link

Join Us for the Memory Decoding Journal Club!

Devin Ward4 Apr 2025 17:13 UTC

1 point

0 comments1 min readLW link

Alignment faking CTFs: Apply to my MATS stream

joshc4 Apr 2025 16:29 UTC

61 points

0 comments4 min readLW link

LLM AGI will have memory, and memory changes alignment

Seth Herd4 Apr 2025 14:59 UTC

81 points

15 comments9 min readLW link

A Bunch of Matryoshka SAEs

chanind, TomasD and Adrià Garriga-alonso

4 Apr 2025 14:53 UTC

29 points

0 comments8 min readLW link

AI CoT Reasoning Is Often Unfaithful

Zvi4 Apr 2025 14:50 UTC

66 points

4 comments7 min readLW link

(thezvi.wordpress.com)

Meditation and Reduced Sleep Need

niplav4 Apr 2025 14:42 UTC

36 points

8 comments3 min readLW link

Suing OpenAI Won’t Save the Arts

E.G. Blee-Goldman4 Apr 2025 13:42 UTC

2 points

0 comments5 min readLW link

For Policy’s Sake: Why We Must Distinguish AI Safety from AI Security in Regulatory Governance

Katalina Hernandez4 Apr 2025 9:16 UTC

6 points

11 comments6 min readLW link

Explaining the Joke: Pausing is The Way

WillPetillo4 Apr 2025 9:04 UTC

25 points

2 comments10 min readLW link

ACX/EA Hyderabad Meetup

vmehra and Aditya S

4 Apr 2025 8:12 UTC

3 points

2 comments1 min readLW link

Tools for decision-support, deliberation, sense-making, reasoning

David James4 Apr 2025 2:27 UTC

3 points

0 comments1 min readLW link

Cheesecake Frosting

jefftk4 Apr 2025 2:10 UTC

10 points

9 comments1 min readLW link

(www.jefftk.com)

Changing my mind about Christiano’s malign prior argument

Cole Wyeth4 Apr 2025 0:54 UTC

37 points

34 comments7 min readLW link

POTUS Predictions Tournament

ChristianWilliams3 Apr 2025 22:48 UTC

15 points

0 comments1 min readLW link

(www.metaculus.com)

“Long” timelines to advanced AI have gotten crazy short

Matrice Jacobine3 Apr 2025 22:46 UTC

21 points

0 comments1 min readLW link

(helentoner.substack.com)