Po­si­tional ker­nels of at­ten­tion heads

Alex GibsonMar 10, 2025, 11:17 PM
9 points
0 comments12 min readLW link

Progress links and short notes, 2025-03-10

jasoncrawfordMar 10, 2025, 8:27 PM
8 points
0 comments4 min readLW link
(newsletter.rootsofprogress.org)

The Manus Mar­ket­ing Madness

ZviMar 10, 2025, 8:10 PM
54 points
0 comments24 min readLW link
(thezvi.wordpress.com)

You can just play

aswath krishnanMar 10, 2025, 8:00 PM
−5 points
0 comments2 min readLW link

How to Use Prompt Eng­ineer­ing to Rewire Your Brain

aswath krishnanMar 10, 2025, 8:00 PM
1 point
0 comments5 min readLW link
(www.aswathkrishnan.com)

When In­de­pen­dent Op­ti­miza­tion Is Worse Than Randomness

Chaotic rationalistMar 10, 2025, 7:46 PM
−4 points
0 comments2 min readLW link

Stress ex­ists only where the Mind makes it

NoahhMar 10, 2025, 7:44 PM
5 points
2 comments4 min readLW link

Coun­ter­ar­gu­ment to Godel’s Mo­dal On­tolog­i­cal Argument

WynnMar 10, 2025, 7:38 PM
−1 points
0 comments4 min readLW link

[Question] How much do fron­tier LLMs code and browse while in train­ing?

Joe RogeroMar 10, 2025, 7:34 PM
7 points
0 comments1 min readLW link

Ob­ser­va­tions on self-su­per­vised Learn­ing for vision

Dinkar JuyalMar 10, 2025, 7:31 PM
3 points
0 comments5 min readLW link

In­tro­duc­ing 11 New AI Safety Or­ga­ni­za­tions—Cat­alyze’s Win­ter 24/​25 Lon­don In­cu­ba­tion Pro­gram Cohort

Alexandra BosMar 10, 2025, 7:26 PM
70 points
0 commentsLW link

The Jack­pot Jinx (or why “Su­per­in­tel­li­gence Strat­egy” is wrong)

E.G. Blee-GoldmanMar 10, 2025, 7:18 PM
13 points
0 comments5 min readLW link

Effec­tive AI Outreach | A Data Driven Approach

NoahCWilsonMar 10, 2025, 7:18 PM
1 point
0 comments15 min readLW link

Emer­gent AI So­ciety. Tasks, Scarcity, Talks

Andrey SeryakovMar 10, 2025, 7:18 PM
1 point
0 comments5 min readLW link

Sen­tinel min­utes #10/​2025: Trump tar­iffs, US/​China ten­sions, Claude code re­ward hack­ing.

NunoSempereMar 10, 2025, 7:00 PM
25 points
0 comments10 min readLW link
(blog.sentinel-team.org)

Have you ac­tu­ally tried rais­ing the birth rate?

Yair HalberstadtMar 10, 2025, 6:06 PM
6 points
5 comments1 min readLW link

Split Per­son­al­ity Train­ing: Re­veal­ing La­tent Knowl­edge Through Per­son­al­ity-Shift Tokens

Florian_DietzMar 10, 2025, 4:07 PM
37 points
4 comments9 min readLW link

We Have No Plan for Prevent­ing Loss of Con­trol in Open Models

Andrew DicksonMar 10, 2025, 3:35 PM
45 points
11 comments22 min readLW link

Lock-In Threat Models

alamertonMar 10, 2025, 10:22 AM
5 points
0 comments8 min readLW link

Book Re­view: Affec­tive Neuroscience

sarahconstantinMar 10, 2025, 6:50 AM
62 points
8 comments13 min readLW link
(sarahconstantin.substack.com)

The chess­board world

phdeadMar 10, 2025, 1:26 AM
5 points
0 comments8 min readLW link

[Question] when will LLMs be­come hu­man-level blog­gers?

nostalgebraistMar 9, 2025, 9:10 PM
124 points
34 comments6 min readLW link

Every­thing I Know About Se­man­tics I Learned From Mu­sic Notation

J BostockMar 9, 2025, 6:09 PM
34 points
2 comments10 min readLW link

Phoenix Rising

MetacelsusMar 9, 2025, 11:53 AM
66 points
7 comments5 min readLW link
(denovo.substack.com)

How well can Claude write cod­ing ques­tions?

bodryMar 9, 2025, 5:29 AM
3 points
1 comment12 min readLW link

A model of the fi­nal phase: the cur­rent fron­tier AIs as de facto CEOs of their own com­pa­nies

Mitchell_PorterMar 8, 2025, 10:15 PM
23 points
2 comments1 min readLW link

Harry Pot­ter and the Meth­ods of Ra­tion­al­ity 10 Year An­niver­sary Party!

Robert CousineauMar 8, 2025, 9:29 PM
6 points
0 comments1 min readLW link

A case for peer-re­viewed con­spir­acy theories

Sam GMar 8, 2025, 8:41 PM
13 points
2 comments4 min readLW link

The ma­chine has no mouth and it must scream

zefMar 8, 2025, 4:40 PM
77 points
1 comment7 min readLW link
(zephyyr.substack.com)

How Do We Fix the Ed­u­ca­tion Cri­sis?

James CamachoMar 8, 2025, 2:59 AM
12 points
4 comments8 min readLW link

GPT-4.5 Can Play Los­ing Chess

GoteNoSenteMar 8, 2025, 12:58 AM
9 points
0 comments1 min readLW link
(chatgpt.com)

[Question] are “al­most-p-zom­bies” pos­si­ble?

KvmanThinkingMar 7, 2025, 10:58 PM
4 points
3 comments1 min readLW link

Suffi­ciently De­cen­tral­ized In­tel­li­gence is Indis­t­in­guish­able from Synchronicity

SahilMar 7, 2025, 9:50 PM
27 points
0 comments19 min readLW link

Am­plify­ing the Com­pu­ta­tional No-Coin­ci­dence Conjecture

glauberdebonaMar 7, 2025, 9:29 PM
8 points
6 comments7 min readLW link

[ages 16-21] Ap­ply to PAIR & ESPR, Sum­mer AI & Ra­tion­al­ity Programs

Anna GajdovaMar 7, 2025, 7:49 PM
4 points
0 comments1 min readLW link

What if con­scious­ness emerges from a pre­dic­tive loop?

JohnMarkNormanMar 7, 2025, 7:46 PM
2 points
0 comments1 min readLW link

Fore­cast­ing newslet­ter #3/​2025: Long march through the institutions

NunoSempereMar 7, 2025, 6:17 PM
8 points
0 comments1 min readLW link
(forecasting.substack.com)

Child­hood and Ed­u­ca­tion #9: School is Hell

ZviMar 7, 2025, 12:40 PM
52 points
36 comments37 min readLW link
(thezvi.wordpress.com)

The In­san­ity De­tec­tor and Writing

Johannes C. MayerMar 7, 2025, 11:19 AM
20 points
3 comments1 min readLW link

So how well is Claude play­ing Poké­mon?

Julian BradshawMar 7, 2025, 5:54 AM
171 points
74 comments5 min readLW link

Of Lov­ing Grace

Charlie SandersMar 7, 2025, 4:48 AM
−3 points
0 comments3 min readLW link
(www.dailymicrofiction.com)

In-Con­text Schem­ing: A Run is Worth a Thou­sand Words

noise-fieldMar 7, 2025, 2:47 AM
10 points
0 comments1 min readLW link
(github.com)

AI for Mu­sic, A Tool for Ma­nipu­la­tion or Ex­pres­sion?

Sunny Huiseon LeeMar 7, 2025, 2:47 AM
1 point
0 comments1 min readLW link

Are re­cent LLMs bet­ter at rea­son­ing or bet­ter at mem­o­riz­ing?

Mar 7, 2025, 2:44 AM
11 points
0 comments4 min readLW link

The Dead Planet Theory

arealsocietyMar 7, 2025, 2:43 AM
17 points
0 comments1 min readLW link
(open.substack.com)

The end of state

Peter lawless Mar 7, 2025, 12:17 AM
−21 points
1 comment1 min readLW link

How Can Aver­age Peo­ple Con­tribute to AI Safety?

Stephen McAleeseMar 6, 2025, 10:50 PM
16 points
4 comments8 min readLW link

An­thropic’s Recom­men­da­tions to OSTP for the U.S. AI Ac­tion Plan

UnofficialLinkpostBotMar 6, 2025, 10:38 PM
11 points
2 comments2 min readLW link
(www.anthropic.com)

Lots of brief thoughts on Soft­ware Engineering

Yair HalberstadtMar 6, 2025, 7:50 PM
47 points
17 comments10 min readLW link

What the Head­lines Miss About the Lat­est De­ci­sion in the Musk vs. OpenAI Lawsuit

garrisonMar 6, 2025, 7:49 PM
98 points
0 commentsLW link
(garrisonlovely.substack.com)