OpenAI Alums, No­bel Lau­re­ates Urge Reg­u­la­tors to Save Com­pany’s Non­profit Structure

garrisonApr 23, 2025, 11:01 PM
66 points
0 comments8 min readLW link
(garrisonlovely.substack.com)

What AI safety plans are there?

MichaelDickensApr 23, 2025, 10:58 PM
16 points
3 comments1 min readLW link

o3 Is a Ly­ing Liar

ZviApr 23, 2025, 8:00 PM
84 points
26 comments9 min readLW link
(thezvi.wordpress.com)

Put­ting up Bumpers

Sam BowmanApr 23, 2025, 4:05 PM
52 points
14 comments2 min readLW link

The AI Belief-Con­sis­tency Letter

Knight LeeApr 23, 2025, 12:01 PM
−6 points
15 comments4 min readLW link

Jaan Tal­linn’s 2024 Philan­thropy Overview

jaanApr 23, 2025, 11:06 AM
223 points
8 comments1 min readLW link
(jaan.info)

[Question] Are we “be­ing poi­soned”?

TigerlilyApr 23, 2025, 5:11 AM
16 points
2 comments2 min readLW link

To Un­der­stand His­tory, Keep Former Pop­u­la­tion Distri­bu­tions In Mind

Arjun PanicksseryApr 23, 2025, 4:51 AM
240 points
13 comments2 min readLW link
(arjunpanickssery.substack.com)

Fish and Faces

EggsApr 23, 2025, 3:35 AM
8 points
6 comments2 min readLW link

Is al­ign­ment re­ducible to be­com­ing more co­her­ent?

Cole WyethApr 22, 2025, 11:47 PM
19 points
0 comments3 min readLW link

The EU Is Ask­ing for Feed­back on Fron­tier AI Reg­u­la­tion (Open to Global Ex­perts)—This Post Breaks Down What’s at Stake for AI Safety

Katalina HernandezApr 22, 2025, 8:39 PM
60 points
13 comments9 min readLW link

Cor­rupted by Rea­son­ing: Rea­son­ing Lan­guage Models Be­come Free-Riders in Public Goods Games

Apr 22, 2025, 7:25 PM
24 points
3 comments5 min readLW link

Align­ment from equiv­ar­i­ance II—lan­guage equiv­ar­i­ance as a way of figur­ing out what an AI “means”

hamishtodd1Apr 22, 2025, 7:04 PM
5 points
0 comments3 min readLW link

There is no Red Line

TachikomaApr 22, 2025, 6:28 PM
−13 points
1 comment3 min readLW link

Man­i­fund 2025 Regrants

Austin ChenApr 22, 2025, 5:36 PM
21 points
0 comments5 min readLW link
(manifund.substack.com)

AISN#52: An Ex­pert Virol­ogy Benchmark

Apr 22, 2025, 5:08 PM
6 points
0 comments4 min readLW link
(newsletter.safe.ai)

In­tu­ition in AI

Priyanka BharadwajApr 22, 2025, 3:15 PM
−1 points
2 comments2 min readLW link

Prob­lems with Bayesi­anism: A So­cratic Dialogue

B JacobsApr 22, 2025, 2:09 PM
3 points
1 comment14 min readLW link
(bobjacobs.substack.com)

So­cietal and tech­nolog­i­cal progress as sewing an ever-grow­ing, ever-chang­ing, patchy, and poly­chrome quilt

Apr 22, 2025, 1:21 PM
47 points
24 comments25 min readLW link

You Bet­ter Mechanize

ZviApr 22, 2025, 1:10 PM
74 points
6 comments20 min readLW link
(thezvi.wordpress.com)

Ex­per­i­men­tal test­ing: can I treat my­self as a ran­dom sam­ple?

avturchinApr 22, 2025, 12:34 PM
9 points
41 comments4 min readLW link

Fam­ily-line se­lec­tion optimizer

lemonhopeApr 22, 2025, 7:16 AM
2 points
0 comments1 min readLW link

Ac­countabil­ity Sinks

Martin SustrikApr 22, 2025, 5:00 AM
423 points
57 comments15 min readLW link
(250bpm.substack.com)

Most AI value will come from broad au­toma­tion, not from R&D

Matthew BarnettApr 22, 2025, 3:22 AM
10 points
6 comments2 min readLW link
(epoch.ai)

Es­ti­mat (8 Iden­tities)

P. JoãoApr 22, 2025, 2:42 AM
4 points
0 comments3 min readLW link

A Let­ter to His High­ness Louis XV, the King of France

testingthewatersApr 22, 2025, 12:51 AM
2 points
0 comments1 min readLW link
(aclevername.substack.com)

10 Prin­ci­ples for Real Align­ment

AdriaanApr 21, 2025, 10:18 PM
−7 points
0 comments7 min readLW link

AE Stu­dio is hiring!

Trent HodgesonApr 21, 2025, 8:35 PM
20 points
2 comments2 min readLW link

$500 Bounty Prob­lem: Are (Ap­prox­i­mately) Deter­minis­tic Nat­u­ral La­tents All You Need?

Apr 21, 2025, 8:19 PM
92 points
24 comments3 min readLW link

More Than Just A, T, C, and G: Screen­ing for Hid­den Dangers in DNA Sequences

sgdApr 21, 2025, 8:12 PM
1 point
0 comments11 min readLW link

The US Ex­ec­u­tive vs Supreme Court De­por­ta­tions Clash

NunoSempereApr 21, 2025, 7:56 PM
44 points
12 comments7 min readLW link
(blog.sentinel-team.org)

Pod­cast on “AI tools for ex­is­ten­tial se­cu­rity” — transcript

Apr 21, 2025, 7:26 PM
11 points
0 comments43 min readLW link
(pnc.st)

Im­pli­ca­tions for the like­li­hood of hu­man ex­tinc­tion from the re­cent dis­cov­ery of pos­si­ble micro­bial life

MvolzApr 21, 2025, 7:15 PM
1 point
2 comments1 min readLW link

Key event tracker for AI2027

MarkelKoriApr 21, 2025, 7:02 PM
1 point
0 comments1 min readLW link

Load Bear­ing Magic

winstonBosanApr 21, 2025, 6:53 PM
8 points
2 comments3 min readLW link

The Uses of Complacency

sarahconstantinApr 21, 2025, 6:50 PM
88 points
5 comments8 min readLW link
(sarahconstantin.substack.com)

Fea­ture-Based Anal­y­sis of Safety-Rele­vant Multi-Agent Behavior

Apr 21, 2025, 6:12 PM
9 points
0 comments5 min readLW link

Crime and Pu­n­ish­ment #1

ZviApr 21, 2025, 3:30 PM
39 points
10 comments39 min readLW link
(thezvi.wordpress.com)

Im­prov­ing CNNs with Klein Net­works: A Topolog­i­cal Ap­proach to AI

Gunnar CarlssonApr 21, 2025, 3:21 PM
18 points
4 comments5 min readLW link

Eu­logy to the Obits

Apr 21, 2025, 2:10 PM
5 points
1 comment10 min readLW link

Re­search Notes: Run­ning Claude 3.7, Gem­ini 2.5 Pro, and o3 on Poké­mon Red

Julian BradshawApr 21, 2025, 3:52 AM
123 points
20 comments14 min readLW link

Not All Beliefs Are Created Equal: Di­ag­nos­ing Toxic Ideologies

Big_friendly_kiwiApr 21, 2025, 3:18 AM
23 points
7 comments9 min readLW link

AI 2027 is a Bet Against Am­dahl’s Law

snewmanApr 21, 2025, 3:09 AM
126 points
56 comments9 min readLW link

Sev­er­ance and the Ethics of the Con­scious Agents

CrissmanApr 21, 2025, 2:21 AM
4 points
0 comments1 min readLW link

March-April 2025 Progress in Guaran­teed Safe AI

QuinnApr 20, 2025, 7:00 PM
6 points
0 comments4 min readLW link
(gsai.substack.com)

How to end credentialism

Yair HalberstadtApr 20, 2025, 6:50 PM
13 points
15 comments8 min readLW link

Spend­ing on Ourselves

jefftkApr 20, 2025, 6:40 PM
23 points
0 comments3 min readLW link
(www.jefftk.com)

In­ter­est­ing ACX 2024 Book Re­view Entries

jennApr 20, 2025, 6:10 PM
24 points
1 comment4 min readLW link

[Question] To what ethics is an AGI ac­tu­ally safely al­ignable?

StanislavKrymApr 20, 2025, 5:09 PM
1 point
6 comments4 min readLW link

Eval­u­at­ing Over­sight Ro­bust­ness with In­cen­tivized Re­ward Hacking

Apr 20, 2025, 4:53 PM
7 points
2 comments15 min readLW link