All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

All Jan Feb MarAprMay Jun Jul Aug Sep Oct

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30

OpenAI Alums, Nobel Laureates Urge Regulators to Save Company’s Nonprofit Structure

garrison23 Apr 2025 23:01 UTC

66 points

0 comments8 min readLW link

(garrisonlovely.substack.com)

What AI safety plans are there?

MichaelDickens23 Apr 2025 22:58 UTC

16 points

3 comments1 min readLW link

o3 Is a Lying Liar

Zvi23 Apr 2025 20:00 UTC

84 points

26 comments9 min readLW link

(thezvi.wordpress.com)

Putting up Bumpers

Sam Bowman23 Apr 2025 16:05 UTC

54 points

14 comments2 min readLW link

The AI Belief-Consistency Letter

Knight Lee23 Apr 2025 12:01 UTC

−6 points

15 comments4 min readLW link

Jaan Tallinn’s 2024 Philanthropy Overview

jaan23 Apr 2025 11:06 UTC

227 points

8 comments1 min readLW link

(jaan.info)

[Question] Are we “being poisoned”?

Tigerlily23 Apr 2025 5:11 UTC

16 points

2 comments2 min readLW link

To Understand History, Keep Former Population Distributions In Mind

Arjun Panickssery23 Apr 2025 4:51 UTC

240 points

13 comments2 min readLW link

(arjunpanickssery.substack.com)

Fish and Faces

Eggs23 Apr 2025 3:35 UTC

8 points

6 comments2 min readLW link

Is alignment reducible to becoming more coherent?

Cole Wyeth22 Apr 2025 23:47 UTC

19 points

0 comments3 min readLW link

The EU Is Asking for Feedback on Frontier AI Regulation (Open to Global Experts)—This Post Breaks Down What’s at Stake for AI Safety

Katalina Hernandez22 Apr 2025 20:39 UTC

62 points

13 comments9 min readLW link

Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games

David Guzman Piedrahita, Yongjin Yang and Zhijing Jin

22 Apr 2025 19:25 UTC

24 points

3 comments5 min readLW link

Alignment from equivariance II—language equivariance as a way of figuring out what an AI “means”

hamishtodd122 Apr 2025 19:04 UTC

5 points

0 comments3 min readLW link

There is no Red Line

Tachikoma22 Apr 2025 18:28 UTC

−13 points

1 comment3 min readLW link

Manifund 2025 Regrants

Austin Chen22 Apr 2025 17:36 UTC

21 points

0 comments5 min readLW link

(manifund.substack.com)

AISN#52: An Expert Virology Benchmark

Corin Katzke and Dan H

22 Apr 2025 17:08 UTC

6 points

0 comments4 min readLW link

(newsletter.safe.ai)

Intuition in AI

Priyanka Bharadwaj22 Apr 2025 15:15 UTC

−1 points

2 comments2 min readLW link

Problems with Bayesianism: A Socratic Dialogue

B Jacobs22 Apr 2025 14:09 UTC

3 points

1 comment14 min readLW link

(bobjacobs.substack.com)

Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt

Joel Z. Leibo, Wilcunningham, Seb Krier and Manfred Diaz

22 Apr 2025 13:21 UTC

47 points

24 comments25 min readLW link

You Better Mechanize

Zvi22 Apr 2025 13:10 UTC

76 points

6 comments20 min readLW link

(thezvi.wordpress.com)

Experimental testing: can I treat myself as a random sample?

avturchin22 Apr 2025 12:34 UTC

9 points

41 comments4 min readLW link

Family-line selection optimizer

lemonhope22 Apr 2025 7:16 UTC

2 points

0 comments1 min readLW link

Accountability Sinks

Martin Sustrik22 Apr 2025 5:00 UTC

440 points

57 comments15 min readLW link

(250bpm.substack.com)

Most AI value will come from broad automation, not from R&D

Matthew Barnett22 Apr 2025 3:22 UTC

10 points

6 comments2 min readLW link

(epoch.ai)

Estimat (8 Latent Values)

P. João22 Apr 2025 2:42 UTC

4 points

0 comments3 min readLW link

A Letter to His Highness Louis XV, the King of France

testingthewaters22 Apr 2025 0:51 UTC

2 points

0 comments1 min readLW link

(aclevername.substack.com)

10 Principles for Real Alignment

Adriaan21 Apr 2025 22:18 UTC

−7 points

0 comments7 min readLW link

AE Studio is hiring!

Trent Hodgeson21 Apr 2025 20:35 UTC

20 points

2 comments2 min readLW link

$500 Bounty Problem: Are (Approximately) Deterministic Natural Latents All You Need?

johnswentworth and David Lorell

21 Apr 2025 20:19 UTC

92 points

24 comments3 min readLW link

More Than Just A, T, C, and G: Screening for Hidden Dangers in DNA Sequences

sgd21 Apr 2025 20:12 UTC

1 point

0 comments11 min readLW link

The US Executive vs Supreme Court Deportations Clash

NunoSempere21 Apr 2025 19:56 UTC

44 points

12 comments7 min readLW link

(blog.sentinel-team.org)

Podcast on “AI tools for existential security” — transcript

Lizka and fin

21 Apr 2025 19:26 UTC

11 points

0 comments43 min readLW link

(pnc.st)

Implications for the likelihood of human extinction from the recent discovery of possible microbial life

Mvolz21 Apr 2025 19:15 UTC

1 point

2 comments1 min readLW link

Key event tracker for AI2027

MarkelKori21 Apr 2025 19:02 UTC

1 point

0 comments1 min readLW link

Load Bearing Magic

winstonBosan21 Apr 2025 18:53 UTC

8 points

2 comments3 min readLW link

The Uses of Complacency

sarahconstantin21 Apr 2025 18:50 UTC

88 points

5 comments8 min readLW link

(sarahconstantin.substack.com)

Feature-Based Analysis of Safety-Relevant Multi-Agent Behavior

Maria Kapros, Ana Kapros and Perusha Moodley

21 Apr 2025 18:12 UTC

10 points

0 comments5 min readLW link

Crime and Punishment #1

Zvi21 Apr 2025 15:30 UTC

39 points

10 comments39 min readLW link

(thezvi.wordpress.com)

Improving CNNs with Klein Networks: A Topological Approach to AI

Gunnar Carlsson21 Apr 2025 15:21 UTC

18 points

4 comments5 min readLW link

Eulogy to the Obits

Niko_McCarty and xanderbalwit

21 Apr 2025 14:10 UTC

5 points

1 comment10 min readLW link

Research Notes: Running Claude 3.7, Gemini 2.5 Pro, and o3 on Pokémon Red

Julian Bradshaw21 Apr 2025 3:52 UTC

123 points

20 comments14 min readLW link

Not All Beliefs Are Created Equal: Diagnosing Toxic Ideologies

Big_friendly_kiwi21 Apr 2025 3:18 UTC

23 points

7 comments9 min readLW link

AI 2027 is a Bet Against Amdahl’s Law

snewman21 Apr 2025 3:09 UTC

126 points

56 comments9 min readLW link

Severance and the Ethics of the Conscious Agents

Crissman21 Apr 2025 2:21 UTC

4 points

0 comments1 min readLW link

March-April 2025 Progress in Guaranteed Safe AI

Quinn20 Apr 2025 19:00 UTC

6 points

0 comments4 min readLW link

(gsai.substack.com)

How to end credentialism

Yair Halberstadt20 Apr 2025 18:50 UTC

13 points

15 comments8 min readLW link

Spending on Ourselves

jefftk20 Apr 2025 18:40 UTC

23 points

0 comments3 min readLW link

(www.jefftk.com)

Interesting ACX 2024 Book Review Entries

jenn20 Apr 2025 18:10 UTC

24 points

1 comment4 min readLW link

[Question] To what ethics is an AGI actually safely alignable?

StanislavKrym20 Apr 2025 17:09 UTC

1 point

6 comments4 min readLW link

Evaluating Oversight Robustness with Incentivized Reward Hacking

Yoav, Juan V, julianjm and McKennaFitzgerald

20 Apr 2025 16:53 UTC

7 points

2 comments15 min readLW link