$500 + $500 Bounty Prob­lem: Does An (Ap­prox­i­mately) Deter­minis­tic Max­i­mal Re­dund Always Ex­ist?

6 May 2025 23:05 UTC
73 points
16 comments3 min readLW link

Loss Curves

James Camacho6 May 2025 22:22 UTC
16 points
3 comments4 min readLW link
(github.com)

Nega­tive Re­sults on Group SAEs

Josh Engels6 May 2025 21:49 UTC
70 points
3 comments8 min readLW link

ACX At­lanta May 2025 Meetup

Steve French6 May 2025 21:00 UTC
2 points
0 comments1 min readLW link

[Question] What kind of policy by an AGI would make peo­ple happy?

StanislavKrym6 May 2025 18:05 UTC
1 point
2 comments1 min readLW link

AI Safety at the Fron­tier: Paper High­lights, April ’25

gasteigerjo6 May 2025 14:22 UTC
4 points
0 comments7 min readLW link
(aisafetyfrontier.substack.com)

Zucker­berg’s Dystopian AI Vision

Zvi6 May 2025 13:50 UTC
62 points
7 comments11 min readLW link
(thezvi.wordpress.com)

Will pro­tein de­sign tools solve the snake an­tivenom short­age?

Abhishaike Mahajan6 May 2025 13:11 UTC
31 points
0 comments17 min readLW link
(www.owlposting.com)

Utah Court Case Over State Law Re­gard­ing “Per­son­hood” for Non­hu­man Intelligences

Stephen Martin6 May 2025 12:54 UTC
10 points
3 comments2 min readLW link

Global Risks Weekly Roundup #18/​2025: US tar­iff short­ages, mil­i­tary polic­ing, Gaza famine.

NunoSempere6 May 2025 10:39 UTC
31 points
2 comments3 min readLW link
(blog.sentinel-team.org)

OpenAI’s Jig May Be Up

Vale6 May 2025 8:51 UTC
3 points
2 comments3 min readLW link

My Rea­sons for Us­ing Anki

Parker Conley6 May 2025 7:01 UTC
10 points
1 comment3 min readLW link
(parconley.com)

It’s ‘Well, ac­tu­ally...’ all the way down

benwr6 May 2025 5:44 UTC
40 points
34 comments1 min readLW link
(www.benwr.net)

Five Hinge‑Ques­tions That De­cide Whether AGI Is Five Years Away or Twenty

charlieoneill6 May 2025 2:48 UTC
126 points
17 comments5 min readLW link

Non­profit to re­tain con­trol of OpenAI

Archimedes5 May 2025 23:41 UTC
37 points
1 comment1 min readLW link
(openai.com)

Un­ex­pected Con­scious Entities

Gunnar_Zarncke5 May 2025 22:14 UTC
34 points
7 comments6 min readLW link

The First Law of Con­scious Agency: Lin­guis­tic Rel­a­tivity and the Birth of “I”

Dima (lain)5 May 2025 21:20 UTC
−17 points
4 comments2 min readLW link

New­ton’s sec­ond law ex­plained: it works in many universes

Tahp5 May 2025 19:47 UTC
19 points
10 comments15 min readLW link
(quark.rodeo)

Repli­ca­tor->Ve­hi­cle Align­ment and Hu­man->AI Alignment

derelict54325 May 2025 19:23 UTC
0 points
3 comments4 min readLW link

The Sweet Les­son: AI Safety Should Scale With Compute

Jesse Hoogland5 May 2025 19:03 UTC
96 points
3 comments3 min readLW link

[Question] Blue light, ‘Adrenal ASMR’: strange ex­pe­riences I can’t find any liter­a­ture about

vernichtung5 May 2025 18:58 UTC
16 points
6 comments1 min readLW link

Ts­inghua pa­per: Does RL Really In­cen­tivize Rea­son­ing Ca­pac­ity in LLMs Beyond the Base Model?

Thomas Kwa5 May 2025 18:56 UTC
69 points
21 comments2 min readLW link
(arxiv.org)

In­tro & Pro­posal for AGI Model

PickleBrine5 May 2025 18:56 UTC
0 points
0 comments3 min readLW link

AI Su­per­or­ganisms: An Alter­na­tive Path­way to Ar­tifi­cial Superintelligence

Aaron Vanzyl5 May 2025 18:55 UTC
4 points
5 comments15 min readLW link

Kar­ls­ruhe ACX: The colours of her coat

wilm5 May 2025 18:35 UTC
2 points
0 comments1 min readLW link

The Me­tac­u­lus Cup Series Is Live, $5,000 Prize Pool

ChristianWilliams5 May 2025 17:14 UTC
4 points
0 comments2 min readLW link
(www.metaculus.com)

Com­mu­nity Feed­back Re­quest: AI Safety In­tro for Gen­eral Public

5 May 2025 16:38 UTC
6 points
5 comments3 min readLW link

GPT-4o Sy­co­phancy Post Mortem

Zvi5 May 2025 16:00 UTC
55 points
1 comment16 min readLW link
(thezvi.wordpress.com)

Le­gal Su­per­vi­sion of Fron­tier AI Labs is the an­swer.

Gauraventh5 May 2025 13:36 UTC
14 points
2 comments3 min readLW link
(robertandgaurav.substack.com)

The cru­cible — how I think about the situ­a­tion with AI

owencb5 May 2025 13:18 UTC
25 points
1 comment8 min readLW link
(strangecities.substack.com)

Light­ning Talks: Thought, Trick, Curiosity

marta_k5 May 2025 11:49 UTC
2 points
2 comments1 min readLW link

Pro­posal: Liquid Pre­dic­tion Mar­kets for AI Forecasting

Jesse Richardson5 May 2025 5:13 UTC
23 points
2 comments3 min readLW link

Why “Solv­ing Align­ment” Is Likely a Cat­e­gory Mistake

Nate Sharpe5 May 2025 4:26 UTC
21 points
3 comments3 min readLW link

AI, An­i­mals, & Digi­tal Minds 2025: ap­ply to speak by Wed­nes­day!

Alistair Stewart5 May 2025 0:56 UTC
4 points
0 comments1 min readLW link

AI, An­i­mals, & Digi­tal Minds 2025

Alistair Stewart5 May 2025 0:51 UTC
2 points
0 comments1 min readLW link

Notes on the Long Tasks METR pa­per, from a HCAST task contributor

abstractapplic4 May 2025 23:17 UTC
111 points
7 comments2 min readLW link

Why I am not a successionist

Nina Panickssery4 May 2025 19:08 UTC
68 points
54 comments2 min readLW link
(ninapanickssery.substack.com)

Overview: AI Safety Outreach Grass­roots Orgs

4 May 2025 17:39 UTC
47 points
8 comments2 min readLW link

The Power Users We For­got: Why AI Needs Them Now More Than Ever

Anthony Fox4 May 2025 17:23 UTC
1 point
6 comments3 min readLW link

Fake AI law­suits to drive links

Yair Halberstadt4 May 2025 16:53 UTC
22 points
0 comments1 min readLW link
(www.rationalistjudaism.com)

Scott Aaron­son at UT Austin on May 17 | Com­pu­ta­tional Com­plex­ity & Philosophy

ekkolápto4 May 2025 16:42 UTC
1 point
0 comments1 min readLW link

In­ter­pretabil­ity Will Not Reli­ably Find De­cep­tive AI

Neel Nanda4 May 2025 16:32 UTC
329 points
68 comments7 min readLW link

80 con­cepts on my new ver­sion of AI: DecisionBots

Wes R4 May 2025 14:04 UTC
0 points
2 comments15 min readLW link

Where have all the to­kens gone?

braces4 May 2025 13:52 UTC
13 points
7 comments6 min readLW link

The Ukraine War and the Kill Market

Martin Sustrik4 May 2025 7:50 UTC
98 points
14 comments5 min readLW link
(250bpm.substack.com)

PSA: Be­fore May 21 is a good time to sign up for cryonics

AlexMennen4 May 2025 4:10 UTC
53 points
0 comments1 min readLW link

GTFO of the So­cial In­ter­net Be­fore you Can’t: The Miro & Yindi Story

keltan4 May 2025 1:08 UTC
36 points
14 comments11 min readLW link

“Su­per­hu­man” Isn’t Well Specified

JustisMills3 May 2025 23:42 UTC
34 points
9 comments3 min readLW link
(justismills.substack.com)

Nav­i­gat­ing burnout

gw3 May 2025 22:07 UTC
75 points
2 comments9 min readLW link
(www.georgeyw.com)

What is your fa­vorite pod­cast?

ChristianKl3 May 2025 21:25 UTC
32 points
9 comments1 min readLW link