All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Nonprofit to retain control of OpenAI

Archimedes5 May 2025 23:41 UTC

37 points

1 comment1 min readLW link

(openai.com)

Unexpected Conscious Entities

Gunnar_Zarncke5 May 2025 22:14 UTC

34 points

7 comments6 min readLW link

The First Law of Conscious Agency: Linguistic Relativity and the Birth of “I”

Dima (lain)5 May 2025 21:20 UTC

−17 points

4 comments2 min readLW link

Newton’s second law explained: it works in many universes

Tahp5 May 2025 19:47 UTC

19 points

10 comments15 min readLW link

(quark.rodeo)

Replicator->Vehicle Alignment and Human->AI Alignment

derelict54325 May 2025 19:23 UTC

0 points

3 comments4 min readLW link

The Sweet Lesson: AI Safety Should Scale With Compute

Jesse Hoogland5 May 2025 19:03 UTC

98 points

3 comments3 min readLW link

[Question] Blue light, ‘Adrenal ASMR’: strange experiences I can’t find any literature about

vernichtung5 May 2025 18:58 UTC

17 points

6 comments1 min readLW link

Tsinghua paper: Does RL Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Thomas Kwa5 May 2025 18:56 UTC

70 points

22 comments2 min readLW link

(arxiv.org)

Intro & Proposal for AGI Model

PickleBrine5 May 2025 18:56 UTC

0 points

0 comments3 min readLW link

AI Superorganisms: An Alternative Pathway to Artificial Superintelligence

Aaron Vanzyl5 May 2025 18:55 UTC

4 points

5 comments15 min readLW link

Karlsruhe ACX: The colours of her coat

wilm5 May 2025 18:35 UTC

2 points

0 comments1 min readLW link

The Metaculus Cup Series Is Live, $5,000 Prize Pool

ChristianWilliams5 May 2025 17:14 UTC

4 points

0 comments2 min readLW link

(www.metaculus.com)

Community Feedback Request: AI Safety Intro for General Public

Algon and Vishakha

5 May 2025 16:38 UTC

6 points

5 comments3 min readLW link

GPT-4o Sycophancy Post Mortem

Zvi5 May 2025 16:00 UTC

55 points

1 comment16 min readLW link

(thezvi.wordpress.com)

Legal Supervision of Frontier AI Labs is the answer.

Gauraventh5 May 2025 13:36 UTC

14 points

2 comments3 min readLW link

(robertandgaurav.substack.com)

The crucible — how I think about the situation with AI

owencb5 May 2025 13:18 UTC

25 points

1 comment8 min readLW link

(strangecities.substack.com)

Lightning Talks: Thought, Trick, Curiosity

marta_k5 May 2025 11:49 UTC

2 points

2 comments1 min readLW link

Proposal: Liquid Prediction Markets for AI Forecasting

Jesse Richardson5 May 2025 5:13 UTC

23 points

2 comments3 min readLW link

Why “Solving Alignment” Is Likely a Category Mistake

Nate Sharpe5 May 2025 4:26 UTC

22 points

3 comments3 min readLW link

AI, Animals, & Digital Minds 2025: apply to speak by Wednesday!

Alistair Stewart5 May 2025 0:56 UTC

4 points

0 comments1 min readLW link

AI, Animals, & Digital Minds 2025

Alistair Stewart5 May 2025 0:51 UTC

2 points

0 comments1 min readLW link

Notes on the Long Tasks METR paper, from a HCAST task contributor

abstractapplic4 May 2025 23:17 UTC

115 points

8 comments2 min readLW link

Why I am not a successionist

Nina Panickssery4 May 2025 19:08 UTC

68 points

54 comments2 min readLW link

(ninapanickssery.substack.com)

Overview: AI Safety Outreach Grassroots Orgs

Severin T. Seehrich and Benjamin Schmidt

4 May 2025 17:39 UTC

55 points

8 comments2 min readLW link

The Power Users We Forgot: Why AI Needs Them Now More Than Ever

Anthony Fox4 May 2025 17:23 UTC

1 point

6 comments3 min readLW link

Fake AI lawsuits to drive links

Yair Halberstadt4 May 2025 16:53 UTC

22 points

0 comments1 min readLW link

(www.rationalistjudaism.com)

Scott Aaronson at UT Austin on May 17 | Computational Complexity & Philosophy

ekkolápto4 May 2025 16:42 UTC

1 point

0 comments1 min readLW link

Interpretability Will Not Reliably Find Deceptive AI

Neel Nanda4 May 2025 16:32 UTC

344 points

69 comments7 min readLW link

80 concepts on my new version of AI: DecisionBots

Wes R4 May 2025 14:04 UTC

0 points

2 comments15 min readLW link

Where have all the tokens gone?

braces4 May 2025 13:52 UTC

15 points

7 comments6 min readLW link

The Ukraine War and the Kill Market

Martin Sustrik4 May 2025 7:50 UTC

98 points

14 comments5 min readLW link

(250bpm.substack.com)

PSA: Before May 21 is a good time to sign up for cryonics

AlexMennen4 May 2025 4:10 UTC

54 points

0 comments1 min readLW link

GTFO of the Social Internet Before you Can’t: The Miro & Yindi Story

keltan4 May 2025 1:08 UTC

44 points

15 comments11 min readLW link

“Superhuman” Isn’t Well Specified

JustisMills3 May 2025 23:42 UTC

34 points

9 comments3 min readLW link

(justismills.substack.com)

Navigating burnout

gw3 May 2025 22:07 UTC

78 points

2 comments9 min readLW link

(www.georgeyw.com)

What is your favorite podcast?

ChristianKl3 May 2025 21:25 UTC

31 points

9 comments1 min readLW link

[Question] Does translating a post with an LLM affect its rating?

ReverendBayes3 May 2025 14:45 UTC

9 points

9 comments2 min readLW link

SimpleStories: A Better Synthetic Dataset and Tiny Models for Interpretability

Lennart Finke3 May 2025 14:04 UTC

16 points

0 comments1 min readLW link

What’s up with AI’s vision

Joachim Bartosik3 May 2025 13:23 UTC

12 points

19 comments1 min readLW link

Sparsity is the enemy of feature extraction (ft. absorption)

7vik, chanind and Adrià Garriga-alonso

3 May 2025 10:13 UTC

32 points

0 comments6 min readLW link

Exploring out-of-context reasoning (OOCR) fine-tuning in LLMs to increase test-phase awareness

Sanyu Rajakumar3 May 2025 3:33 UTC

8 points

0 comments6 min readLW link

Updates from Comments on “AI 2027 is a Bet Against Amdahl’s Law”

snewman2 May 2025 23:52 UTC

43 points

2 comments13 min readLW link

Attend SPAR’s virtual demo day! (career fair + talks)

agucova2 May 2025 23:45 UTC

9 points

0 comments2 min readLW link

(demoday.sparai.org)

Why does METR score o3 as effective for such a long time duration despite overall poor scores?

Cole Wyeth2 May 2025 22:58 UTC

19 points

3 comments1 min readLW link

Short story: Who is nancygonzalez8451097

Anders Lindström2 May 2025 21:01 UTC

13 points

2 comments5 min readLW link

Interim Research Report: Mechanisms of Awareness

Josh Engels, Neel Nanda and Senthooran Rajamanoharan

2 May 2025 20:29 UTC

43 points

6 comments8 min readLW link

Agents, Tools, and Simulators

WillPetillo, Sean Herrington, Adebayo Mubarak, Can Narin and Spencer Ames

2 May 2025 20:19 UTC

16 points

5 comments10 min readLW link

Obstacles in ARC’s agenda: Low Probability Estimation

David Matolcsi2 May 2025 19:38 UTC

44 points

0 comments6 min readLW link

What’s going on with AI progress and trends? (As of 5/2025)

ryan_greenblatt2 May 2025 19:00 UTC

77 points

8 comments8 min readLW link

When AI Optimizes for the Wrong Thing

Anthony Fox2 May 2025 18:00 UTC

5 points

0 comments1 min readLW link