Non­profit to re­tain con­trol of OpenAI

ArchimedesMay 5, 2025, 11:41 PM
37 points
1 comment1 min readLW link
(openai.com)

Un­ex­pected Con­scious Entities

Gunnar_ZarnckeMay 5, 2025, 10:14 PM
34 points
6 comments6 min readLW link

The First Law of Con­scious Agency: Lin­guis­tic Rel­a­tivity and the Birth of “I”

Dima (lain)May 5, 2025, 9:20 PM
−17 points
4 comments2 min readLW link

New­ton’s sec­ond law ex­plained: it works in many universes

TahpMay 5, 2025, 7:47 PM
19 points
10 comments15 min readLW link
(quark.rodeo)

Repli­ca­tor->Ve­hi­cle Align­ment and Hu­man->AI Alignment

derelict5432May 5, 2025, 7:23 PM
0 points
3 comments4 min readLW link

The Sweet Les­son: AI Safety Should Scale With Compute

Jesse HooglandMay 5, 2025, 7:03 PM
95 points
3 comments3 min readLW link

[Question] Blue light, ‘Adrenal ASMR’: strange ex­pe­riences I can’t find any liter­a­ture about

vernichtungMay 5, 2025, 6:58 PM
16 points
6 comments1 min readLW link

Ts­inghua pa­per: Does RL Really In­cen­tivize Rea­son­ing Ca­pac­ity in LLMs Beyond the Base Model?

Thomas KwaMay 5, 2025, 6:56 PM
68 points
21 comments2 min readLW link
(arxiv.org)

In­tro & Pro­posal for AGI Model

PickleBrineMay 5, 2025, 6:56 PM
0 points
0 comments3 min readLW link

AI Su­per­or­ganisms: An Alter­na­tive Path­way to Ar­tifi­cial Superintelligence

Aaron VanzylMay 5, 2025, 6:55 PM
4 points
5 comments15 min readLW link

Kar­ls­ruhe ACX: The colours of her coat

wilmMay 5, 2025, 6:35 PM
2 points
0 comments1 min readLW link

The Me­tac­u­lus Cup Series Is Live, $5,000 Prize Pool

ChristianWilliamsMay 5, 2025, 5:14 PM
4 points
0 commentsLW link
(www.metaculus.com)

Com­mu­nity Feed­back Re­quest: AI Safety In­tro for Gen­eral Public

May 5, 2025, 4:38 PM
6 points
5 comments3 min readLW link

GPT-4o Sy­co­phancy Post Mortem

ZviMay 5, 2025, 4:00 PM
55 points
1 comment16 min readLW link
(thezvi.wordpress.com)

Le­gal Su­per­vi­sion of Fron­tier AI Labs is the an­swer.

GauraventhMay 5, 2025, 1:36 PM
14 points
2 comments3 min readLW link
(robertandgaurav.substack.com)

The cru­cible — how I think about the situ­a­tion with AI

owencbMay 5, 2025, 1:18 PM
25 points
1 comment8 min readLW link
(strangecities.substack.com)

Light­ning Talks: Thought, Trick, Curiosity

marta_kMay 5, 2025, 11:49 AM
1 point
0 comments1 min readLW link

Are stan­dard­ized tests effec­tive?

HrussMay 5, 2025, 10:02 AM
1 point
1 comment1 min readLW link

Pro­posal: Liquid Pre­dic­tion Mar­kets for AI Forecasting

Jesse RichardsonMay 5, 2025, 5:13 AM
23 points
2 comments3 min readLW link

Why “Solv­ing Align­ment” Is Likely a Cat­e­gory Mistake

Nate SharpeMay 5, 2025, 4:26 AM
22 points
3 comments3 min readLW link

AI, An­i­mals, & Digi­tal Minds 2025: ap­ply to speak by Wed­nes­day!

Alistair StewartMay 5, 2025, 12:56 AM
4 points
0 comments1 min readLW link

AI, An­i­mals, & Digi­tal Minds 2025

Alistair StewartMay 5, 2025, 12:51 AM
2 points
0 comments1 min readLW link

Notes on the Long Tasks METR pa­per, from a HCAST task contributor

abstractapplicMay 4, 2025, 11:17 PM
108 points
7 comments2 min readLW link

Why I am not a successionist

Nina PanicksseryMay 4, 2025, 7:08 PM
62 points
52 comments2 min readLW link
(ninapanickssery.substack.com)

Overview: AI Safety Outreach Grass­roots Orgs

May 4, 2025, 5:39 PM
46 points
8 comments2 min readLW link

The Power Users We For­got: Why AI Needs Them Now More Than Ever

Anthony FoxMay 4, 2025, 5:23 PM
1 point
6 comments3 min readLW link

Fake AI law­suits to drive links

Yair HalberstadtMay 4, 2025, 4:53 PM
22 points
0 comments1 min readLW link
(www.rationalistjudaism.com)

Scott Aaron­son at UT Austin on May 17 | Com­pu­ta­tional Com­plex­ity & Philosophy

ekkoláptoMay 4, 2025, 4:42 PM
1 point
0 comments1 min readLW link

In­ter­pretabil­ity Will Not Reli­ably Find De­cep­tive AI

Neel NandaMay 4, 2025, 4:32 PM
316 points
66 comments7 min readLW link

80 con­cepts on my new ver­sion of AI: DecisionBots

Wes RMay 4, 2025, 2:04 PM
0 points
2 comments15 min readLW link

Where have all the to­kens gone?

bracesMay 4, 2025, 1:52 PM
13 points
7 comments6 min readLW link

The Ukraine War and the Kill Market

Martin SustrikMay 4, 2025, 7:50 AM
98 points
13 comments5 min readLW link
(250bpm.substack.com)

PSA: Be­fore May 21 is a good time to sign up for cryonics

AlexMennenMay 4, 2025, 4:10 AM
53 points
0 comments1 min readLW link

GTFO of the So­cial In­ter­net Be­fore you Can’t: The Miro & Yindi Story

keltanMay 4, 2025, 1:08 AM
30 points
12 comments10 min readLW link

“Su­per­hu­man” Isn’t Well Specified

JustisMillsMay 3, 2025, 11:42 PM
32 points
9 comments3 min readLW link
(justismills.substack.com)

Nav­i­gat­ing burnout

gwMay 3, 2025, 10:07 PM
73 points
1 comment9 min readLW link
(www.georgeyw.com)

What is your fa­vorite pod­cast?

ChristianKlMay 3, 2025, 9:25 PM
32 points
9 comments1 min readLW link

[Question] Does trans­lat­ing a post with an LLM af­fect its rat­ing?

ReverendBayesMay 3, 2025, 2:45 PM
9 points
9 comments2 min readLW link

Sim­pleS­to­ries: A Bet­ter Syn­thetic Dataset and Tiny Models for Interpretability

Lennart FinkeMay 3, 2025, 2:04 PM
13 points
0 comments1 min readLW link

What’s up with AI’s vision

Joachim BartosikMay 3, 2025, 1:23 PM
12 points
19 comments1 min readLW link

Spar­sity is the en­emy of fea­ture ex­trac­tion (ft. ab­sorp­tion)

May 3, 2025, 10:13 AM
31 points
0 comments6 min readLW link

Ex­plor­ing out-of-con­text rea­son­ing (OOCR) fine-tun­ing in LLMs to in­crease test-phase awareness

Sanyu RajakumarMay 3, 2025, 3:33 AM
8 points
0 comments6 min readLW link

Pri­son Jour­nal: Build­ing Bet­ter Think­ing Skills—Altru­is­tic Per­son Saved > 100 Go­rillas saved

P. JoãoMay 3, 2025, 1:34 AM
−30 points
2 comments1 min readLW link

Up­dates from Com­ments on “AI 2027 is a Bet Against Am­dahl’s Law”

snewmanMay 2, 2025, 11:52 PM
40 points
2 comments13 min readLW link

At­tend SPAR’s vir­tual demo day! (ca­reer fair + talks)

agucovaMay 2, 2025, 11:45 PM
9 points
0 commentsLW link
(demoday.sparai.org)

Why does METR score o3 as effec­tive for such a long time du­ra­tion de­spite over­all poor scores?

Cole WyethMay 2, 2025, 10:58 PM
19 points
3 comments1 min readLW link

Short story: Who is nan­cy­gon­za­lez8451097

Anders LindströmMay 2, 2025, 9:01 PM
13 points
2 comments5 min readLW link

In­terim Re­search Re­port: Mechanisms of Awareness

May 2, 2025, 8:29 PM
43 points
6 comments8 min readLW link

Agents, Tools, and Simulators

May 2, 2025, 8:19 PM
12 points
2 comments10 min readLW link

Ob­sta­cles in ARC’s agenda: Low Prob­a­bil­ity Estimation

David MatolcsiMay 2, 2025, 7:38 PM
43 points
0 comments6 min readLW link