6 May 2025 23:05 UTC

74 points

19 comments3 min readLW link

Loss Curves

programjames6 May 2025 22:22 UTC

16 points

3 comments4 min readLW link

(github.com)

Negative Results on Group SAEs

Josh Engels6 May 2025 21:49 UTC

78 points

3 comments8 min readLW link

ACX Atlanta May 2025 Meetup

Steve French6 May 2025 21:00 UTC

2 points

0 comments1 min readLW link

[Question] What kind of policy by an AGI would make people happy?

StanislavKrym6 May 2025 18:05 UTC

1 point

2 comments1 min readLW link

AI Safety at the Frontier: Paper Highlights, April ’25

gasteigerjo6 May 2025 14:22 UTC

4 points

0 comments7 min readLW link

(aisafetyfrontier.substack.com)

Zuckerberg’s Dystopian AI Vision

Zvi6 May 2025 13:50 UTC

62 points

7 comments11 min readLW link

(thezvi.wordpress.com)

Will protein design tools solve the snake antivenom shortage?

Abhishaike Mahajan6 May 2025 13:11 UTC

31 points

0 comments17 min readLW link

(www.owlposting.com)

Utah Court Case Over State Law Regarding “Personhood” for Nonhuman Intelligences

Stephen Martin6 May 2025 12:54 UTC

10 points

3 comments2 min readLW link

Global Risks Weekly Roundup #18/2025: US tariff shortages, military policing, Gaza famine.

NunoSempere6 May 2025 10:39 UTC

31 points

2 comments3 min readLW link

(blog.sentinel-team.org)

OpenAI’s Jig May Be Up

Vale6 May 2025 8:51 UTC

3 points

2 comments3 min readLW link

My Reasons for Using Anki

Parker Conley6 May 2025 7:01 UTC

10 points

1 comment3 min readLW link

(parconley.com)

It’s ‘Well, actually...’ all the way down

benwr6 May 2025 5:44 UTC

40 points

34 comments1 min readLW link

(www.benwr.net)

Five Hinge‑Questions That Decide Whether AGI Is Five Years Away or Twenty

charlieoneill6 May 2025 2:48 UTC

130 points

17 comments5 min readLW link

Nonprofit to retain control of OpenAI

Archimedes5 May 2025 23:41 UTC

37 points

1 comment1 min readLW link

(openai.com)

Unexpected Conscious Entities

Gunnar_Zarncke5 May 2025 22:14 UTC

34 points

7 comments6 min readLW link

The First Law of Conscious Agency: Linguistic Relativity and the Birth of “I”

Dima (lain)5 May 2025 21:20 UTC

−17 points

4 comments2 min readLW link

Newton’s second law explained: it works in many universes

Tahp5 May 2025 19:47 UTC

19 points

10 comments15 min readLW link

(quark.rodeo)

Replicator->Vehicle Alignment and Human->AI Alignment

derelict54325 May 2025 19:23 UTC

0 points

3 comments4 min readLW link

The Sweet Lesson: AI Safety Should Scale With Compute

Jesse Hoogland5 May 2025 19:03 UTC

98 points

3 comments3 min readLW link

[Question] Blue light, ‘Adrenal ASMR’: strange experiences I can’t find any literature about

vernichtung5 May 2025 18:58 UTC

17 points

6 comments1 min readLW link

Tsinghua paper: Does RL Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Thomas Kwa5 May 2025 18:56 UTC

70 points

22 comments2 min readLW link

(arxiv.org)

Intro & Proposal for AGI Model

PickleBrine5 May 2025 18:56 UTC

0 points

0 comments3 min readLW link

AI Superorganisms: An Alternative Pathway to Artificial Superintelligence

Aaron Vanzyl5 May 2025 18:55 UTC

4 points

5 comments15 min readLW link

Karlsruhe ACX: The colours of her coat

wilm5 May 2025 18:35 UTC

2 points

0 comments1 min readLW link

The Metaculus Cup Series Is Live, $5,000 Prize Pool

ChristianWilliams5 May 2025 17:14 UTC

4 points

0 comments2 min readLW link

(www.metaculus.com)

Community Feedback Request: AI Safety Intro for General Public

Algon and Vishakha

5 May 2025 16:38 UTC

6 points

5 comments3 min readLW link

GPT-4o Sycophancy Post Mortem

Zvi5 May 2025 16:00 UTC

55 points

1 comment16 min readLW link

(thezvi.wordpress.com)

Legal Supervision of Frontier AI Labs is the answer.

Gauraventh5 May 2025 13:36 UTC

14 points

2 comments3 min readLW link

(robertandgaurav.substack.com)

The crucible — how I think about the situation with AI

owencb5 May 2025 13:18 UTC

25 points

1 comment8 min readLW link

(strangecities.substack.com)

Lightning Talks: Thought, Trick, Curiosity

marta_k5 May 2025 11:49 UTC

2 points

2 comments1 min readLW link

Proposal: Liquid Prediction Markets for AI Forecasting

Jesse Richardson5 May 2025 5:13 UTC

23 points

2 comments3 min readLW link

Why “Solving Alignment” Is Likely a Category Mistake

Nate Sharpe5 May 2025 4:26 UTC

22 points

3 comments3 min readLW link

AI, Animals, & Digital Minds 2025: apply to speak by Wednesday!

Alistair Stewart5 May 2025 0:56 UTC

4 points

0 comments1 min readLW link

AI, Animals, & Digital Minds 2025

Alistair Stewart5 May 2025 0:51 UTC

2 points

0 comments1 min readLW link

Notes on the Long Tasks METR paper, from a HCAST task contributor

abstractapplic4 May 2025 23:17 UTC

115 points

8 comments2 min readLW link

Why I am not a successionist

Nina Panickssery4 May 2025 19:08 UTC

68 points

54 comments2 min readLW link

(ninapanickssery.substack.com)

Overview: AI Safety Outreach Grassroots Orgs

Severin T. Seehrich and Benjamin Schmidt

4 May 2025 17:39 UTC

55 points

8 comments2 min readLW link

The Power Users We Forgot: Why AI Needs Them Now More Than Ever

Anthony Fox4 May 2025 17:23 UTC

1 point

6 comments3 min readLW link

Fake AI lawsuits to drive links

Yair Halberstadt4 May 2025 16:53 UTC

22 points

0 comments1 min readLW link

(www.rationalistjudaism.com)

Scott Aaronson at UT Austin on May 17 | Computational Complexity & Philosophy

ekkolápto4 May 2025 16:42 UTC

1 point

0 comments1 min readLW link

Interpretability Will Not Reliably Find Deceptive AI

Neel Nanda4 May 2025 16:32 UTC

341 points

69 comments7 min readLW link

80 concepts on my new version of AI: DecisionBots

Wes R4 May 2025 14:04 UTC

0 points

2 comments15 min readLW link

Where have all the tokens gone?

braces4 May 2025 13:52 UTC

15 points

7 comments6 min readLW link

The Ukraine War and the Kill Market

Martin Sustrik4 May 2025 7:50 UTC

98 points

14 comments5 min readLW link

(250bpm.substack.com)

PSA: Before May 21 is a good time to sign up for cryonics

AlexMennen4 May 2025 4:10 UTC

54 points

0 comments1 min readLW link

GTFO of the Social Internet Before you Can’t: The Miro & Yindi Story

keltan4 May 2025 1:08 UTC

43 points

15 comments11 min readLW link

“Superhuman” Isn’t Well Specified

JustisMills3 May 2025 23:42 UTC

34 points

9 comments3 min readLW link

(justismills.substack.com)

Navigating burnout

gw3 May 2025 22:07 UTC

78 points

2 comments9 min readLW link

(www.georgeyw.com)

What is your favorite podcast?

ChristianKl3 May 2025 21:25 UTC

31 points

9 comments1 min readLW link