AI Poli­tics: Po­lariza­tion and Chaos

American Psychohistory31 Mar 2025 23:46 UTC
4 points
0 comments4 min readLW link

Call for Col­lab­o­ra­tion: Renor­mal­iza­tion for AI safety

Lauren Greenspan31 Mar 2025 21:01 UTC
35 points
0 comments4 min readLW link

A re­sponse to OpenAI’s “How we think about safety and al­ign­ment”

Harlan31 Mar 2025 20:58 UTC
11 points
0 comments6 min readLW link
(intelligence.org)

Op­por­tu­nity Space: Renor­mal­iza­tion for AI Safety

Lauren Greenspan31 Mar 2025 20:55 UTC
22 points
0 comments6 min readLW link

Renor­mal­iza­tion Roadmap

Lauren Greenspan31 Mar 2025 20:34 UTC
64 points
7 comments18 min readLW link

AISN #50: AI Ac­tion Plan Responses

31 Mar 2025 20:13 UTC
6 points
0 comments6 min readLW link
(newsletter.safe.ai)

deleted

funnyfranco31 Mar 2025 19:26 UTC
0 points
31 comments1 min readLW link

Fundrais­ing for Mox: cowork­ing & events in SF

Austin Chen31 Mar 2025 18:25 UTC
27 points
0 comments6 min readLW link
(manifund.org)

OpenAI #12: Bat­tle of the Board Redux

Zvi31 Mar 2025 15:50 UTC
142 points
1 comment9 min readLW link
(thezvi.wordpress.com)

Rou­tine Novelty

BazingaBoy31 Mar 2025 15:47 UTC
1 point
0 comments1 min readLW link

Why does Claude Speak Byzan­tine Mu­sic No­ta­tion?

Lennart Finke31 Mar 2025 15:13 UTC
18 points
2 comments3 min readLW link

When the Wannabe Rambo Co­me­dian Cried

P. João31 Mar 2025 14:47 UTC
36 points
0 comments3 min readLW link

A Frac­tion of Global Mar­ket Cap­i­tal­iza­tion as the Best Currency

Greenless Mirror31 Mar 2025 13:30 UTC
1 point
25 comments7 min readLW link

The Apoca­lypse is Near. Can Hu­man­ity Coex­ist with Ar­tifi­cial Su­per­in­tel­li­gence?

Jakub Growiec31 Mar 2025 13:17 UTC
4 points
0 comments11 min readLW link

Sam Alt­man’s sister claims Sam sex­u­ally abused her—Part 11: List of An­nie’s on­line ac­counts, References

pythagoras501531 Mar 2025 12:26 UTC
3 points
1 comment105 min readLW link

Sam Alt­man’s sister claims Sam sex­u­ally abused her—Part 10: re­sponses from Sam and his fam­ily mem­bers; my perspective

pythagoras501531 Mar 2025 12:26 UTC
1 point
1 comment25 min readLW link

Sam Alt­man’s sister claims Sam sex­u­ally abused her—Part 9: liter­a­ture on child sex­ual abuse and trauma

pythagoras501531 Mar 2025 12:25 UTC
4 points
0 comments141 min readLW link

Sam Alt­man’s sister claims Sam sex­u­ally abused her—Part 6: Timeline, continued

pythagoras501531 Mar 2025 12:25 UTC
3 points
0 comments88 min readLW link

Sam Alt­man’s sister claims Sam sex­u­ally abused her—Part 3: Timeline, continued

pythagoras501531 Mar 2025 12:24 UTC
3 points
0 comments81 min readLW link

Sam Alt­man’s sister claims Sam sex­u­ally abused her—Part 2: An­nie’s law­suit; the re­sponse from Sam, his broth­ers, and his mother; Timeline

pythagoras501531 Mar 2025 12:24 UTC
4 points
0 comments65 min readLW link

Story Feed­back Re­quest: The Policy—Emer­gent Align­ment, Re­cur­sive Cog­ni­tion, and AGI Trajectories

queelius31 Mar 2025 11:08 UTC
10 points
2 comments48 min readLW link

On Re­cent Re­sults in LLM La­tent Reasoning

Rauno Arike31 Mar 2025 11:06 UTC
36 points
6 comments13 min readLW link

OpenAI lost $5 billion in 2024 (and its losses are in­creas­ing)

Remmelt31 Mar 2025 4:17 UTC
29 points
15 comments12 min readLW link
(www.wheresyoured.at)

The Leapfrog­ging Ter­minus and the Fuzzy Cut

Jim Pivarski31 Mar 2025 4:08 UTC
22 points
6 comments13 min readLW link

CoreWeave Is A Time Bomb

Remmelt31 Mar 2025 3:52 UTC
5 points
0 comments2 min readLW link
(www.wheresyoured.at)

Down­stream ap­pli­ca­tions as val­i­da­tion of in­ter­pretabil­ity progress

Sam Marks31 Mar 2025 1:35 UTC
112 points
3 comments7 min readLW link

Effi­ciency as a 2-place word

Adam Zerner31 Mar 2025 1:17 UTC
13 points
3 comments6 min readLW link

Some Mee­tups I Ran (Q1 2025)

jenn31 Mar 2025 1:12 UTC
30 points
2 comments8 min readLW link

Ap­par­ent In­tro­spec­tion in Claude: A Case Study in Pro­jected Mind

robert_saltzman31 Mar 2025 0:51 UTC
5 points
0 comments1 min readLW link

Align­ment first, in­tel­li­gence later

Chris Lakin30 Mar 2025 22:26 UTC
18 points
5 comments1 min readLW link
(chrislakin.blog)

[Question] Why do many peo­ple who care about AI Safety not clearly en­dorse PauseAI?

humnrdble30 Mar 2025 18:06 UTC
45 points
42 comments2 min readLW link

Bonn ACX Meetup Spring 2025

Fernand030 Mar 2025 15:12 UTC
2 points
1 comment1 min readLW link

What does al­ign­ing AI to an ide­ol­ogy mean for true al­ign­ment?

StanislavKrym30 Mar 2025 15:12 UTC
1 point
0 comments8 min readLW link

How to en­joy fail at­tempts with­out self-de­cep­tion (tech­nique)

YanLyutnev30 Mar 2025 13:49 UTC
9 points
0 comments9 min readLW link

Me­mory Per­sis­tence within Con­ver­sa­tion Threads with Mul­ti­modal LLMS

sjay830 Mar 2025 7:16 UTC
4 points
0 comments1 min readLW link

How I talk to those above me

Maxwell Peterson30 Mar 2025 6:54 UTC
104 points
16 comments8 min readLW link

Climb­ing the Hill of Experiments

nomagicpill29 Mar 2025 20:37 UTC
4 points
0 comments6 min readLW link
(nomagicpill.github.io)

[Question] Does the AI con­trol agenda broadly rely on no FOOM be­ing pos­si­ble?

Noosphere8929 Mar 2025 19:38 UTC
22 points
3 comments1 min readLW link

Ex­er­cis­ing Rationality

Eggs29 Mar 2025 19:08 UTC
4 points
0 comments4 min readLW link

AI Needs Us? In­for­ma­tion The­ory and Hu­mans as data

tomdekan29 Mar 2025 15:51 UTC
0 points
6 comments4 min readLW link

Auto Shut­down Script

jefftk29 Mar 2025 13:10 UTC
16 points
5 comments1 min readLW link
(www.jefftk.com)

Pro­posal for a Post-La­bor So­cietal Struc­ture to Miti­gate ASI Risks: The ‘Game Cul­ture Civ­i­liza­tion’ (GCC) Model

Beyond Singularity29 Mar 2025 11:31 UTC
3 points
0 comments4 min readLW link

Tor­ment­ing Gem­ini 2.5 with the [[[]]][][[]] Puzzle

Czynski29 Mar 2025 2:51 UTC
48 points
37 comments3 min readLW link

Sin­gu­lar­ity Sur­vival Guide: A Bayesian Guide for Nav­i­gat­ing the Pre-Sin­gu­lar­ity Period

mbrooks28 Mar 2025 23:21 UTC
6 points
4 comments2 min readLW link

Soft­max, Em­mett Shear’s new AI startup fo­cused on “Or­ganic Align­ment”

Chris Lakin28 Mar 2025 21:23 UTC
61 points
2 comments1 min readLW link
(www.corememory.com)

The Pando Prob­lem: Re­think­ing AI Individuality

Jan_Kulveit28 Mar 2025 21:03 UTC
133 points
14 comments13 min readLW link

Selec­tion Pres­sures on LM Personas

Raymond Douglas28 Mar 2025 20:33 UTC
40 points
0 comments3 min readLW link

AXRP Epi­sode 40 - Ja­son Gross on Com­pact Proofs and Interpretability

DanielFilan28 Mar 2025 18:40 UTC
26 points
0 comments89 min readLW link

[Question] Share AI Safety Ideas: Both Crazy and Not. №2

ank28 Mar 2025 17:22 UTC
2 points
10 comments1 min readLW link

AI x Bio Workshop

Allison Duettmann28 Mar 2025 17:21 UTC
16 points
0 comments1 min readLW link