MRI tracers

bhauth7 Jun 2025 23:03 UTC
28 points
2 comments2 min readLW link
(www.bhauth.com)

Se­cond or­der taste

Adam Zerner7 Jun 2025 20:26 UTC
8 points
3 comments4 min readLW link

Di­men­sion­al­iz­ing Fore­cast Value

Jordan Rubin7 Jun 2025 18:45 UTC
5 points
0 comments6 min readLW link

On work­ing 80%

adrische7 Jun 2025 17:58 UTC
87 points
7 comments3 min readLW link
(github.com)

Meta Align­ment: Com­mu­ni­ca­tion Guide

Bridgett Kay7 Jun 2025 16:09 UTC
13 points
0 comments5 min readLW link
(dxmrevealed.wordpress.com)

Ex­plor­ing vo­cab­u­lary al­ign­ment of neu­rons in Llama-3.2-1B

Sergii7 Jun 2025 11:20 UTC
4 points
0 comments3 min readLW link
(grgv.xyz)

Sum­mer ACX Meetup in Bordeaux

vi21maobk9vp7 Jun 2025 11:08 UTC
5 points
0 comments1 min readLW link

Vuln­er­a­bil­ity in Trusted Mon­i­tor­ing and Mitigations

7 Jun 2025 7:16 UTC
17 points
1 comment7 min readLW link

Not max­i­miz­ing your own hap­piness is a fallacy

fasf7 Jun 2025 6:16 UTC
−39 points
7 comments1 min readLW link

Agents, Si­mu­la­tors and Interpretability

7 Jun 2025 6:06 UTC
12 points
0 comments5 min readLW link

Solo Park Play at Three

jefftk7 Jun 2025 3:00 UTC
45 points
2 comments1 min readLW link
(www.jefftk.com)

The Roots of Progress wants your sto­ries about the AI frontier

jasoncrawford6 Jun 2025 22:52 UTC
11 points
0 comments5 min readLW link
(newsletter.rootsofprogress.org)

Un­su­per­vised Ac­ti­va­tion Steer­ing: Find a steer­ing vec­tor that best rep­re­sents any set of text data

Danielle Ensign6 Jun 2025 22:37 UTC
3 points
2 comments1 min readLW link

The Mir­ror Trap

Cameron Berg6 Jun 2025 22:30 UTC
94 points
13 comments4 min readLW link

AXRP Epi­sode 42 - Owain Evans on LLM Psychology

DanielFilan6 Jun 2025 20:20 UTC
13 points
0 comments66 min readLW link

Ap­ply now to Hu­man-Aligned AI Sum­mer School 2025

6 Jun 2025 19:31 UTC
28 points
1 comment2 min readLW link
(humanaligned.ai)

The Com­mon Pile and Comma-v0.1

Trevor Hill-Hand6 Jun 2025 19:20 UTC
3 points
0 comments1 min readLW link

Max­i­mal Cu­ri­ousity is Not Useful

Max Niederman6 Jun 2025 19:08 UTC
11 points
0 comments2 min readLW link

Mak­ing deals with AIs: A tour­na­ment ex­per­i­ment with a bounty

6 Jun 2025 18:51 UTC
22 points
0 comments8 min readLW link

[Question] Does any­one have a good sys­tem for pri­ori­tis­ing pub­lish­ing drafts?

William Howard6 Jun 2025 16:58 UTC
6 points
1 comment1 min readLW link

Deep­Seek-r1-0528 Did Not Have a Moment

Zvi6 Jun 2025 15:40 UTC
30 points
2 comments15 min readLW link
(thezvi.wordpress.com)

Les­sons from a year of uni­ver­sity AI safety field building

6 Jun 2025 14:35 UTC
33 points
3 comments7 min readLW link

The De­mon of Interrelation

Jack6 Jun 2025 8:19 UTC
−2 points
0 comments8 min readLW link

Real-time voice translation

samuelshadrach6 Jun 2025 7:40 UTC
2 points
0 comments1 min readLW link

Li­a­bil­ity for Mi­suse of Models—Dean Ball’s Proposal

Stephen Martin6 Jun 2025 5:34 UTC
2 points
0 comments9 min readLW link

How do AI agents work to­gether when they can’t trust each other?

James Sullivan6 Jun 2025 3:10 UTC
16 points
0 comments8 min readLW link
(jamessullivan092.substack.com)

Large Lan­guage Models suffer from An­tero­grade Amnesia

Annapurna6 Jun 2025 1:30 UTC
7 points
0 comments3 min readLW link
(jorgevelez.substack.com)

Dis­con­tin­u­ous Lin­ear Func­tions?!

Zack_M_Davis6 Jun 2025 0:29 UTC
45 points
10 comments2 min readLW link
(zackmdavis.net)

Avoid­ing AI De­cep­tion: Lie De­tec­tors can ei­ther In­duce Hon­esty or Evasion

5 Jun 2025 23:07 UTC
22 points
2 comments5 min readLW link
(far.ai)

In­tro­duc­ing: Meri­dian Cam­bridge’s new on­line lec­ture se­ries cov­er­ing fron­tier AI and AI safety

Meridian Cambridge5 Jun 2025 21:55 UTC
1 point
0 comments1 min readLW link

cheaper sodium electrolysis

bhauth5 Jun 2025 21:49 UTC
23 points
3 comments4 min readLW link
(www.bhauth.com)

His­tograms are to CDFs as cal­ibra­tion plots are to...

Optimization Process5 Jun 2025 20:20 UTC
35 points
9 comments1 min readLW link
(optimizationprocess.com)

In­te­gra­tion Band­width: The Mechanism Be­hind In­tel­li­gence and Puberty

Dortex5 Jun 2025 19:37 UTC
−1 points
4 comments1 min readLW link
(osf.io)

Levels of Doom: Eu­topia, Disem­pow­er­ment, Extinction

Vladimir_Nesov5 Jun 2025 19:08 UTC
34 points
1 comment2 min readLW link

LLM in-con­text learn­ing as (ap­prox­i­mat­ing) Solomonoff induction

Cole Wyeth5 Jun 2025 17:45 UTC
31 points
3 comments4 min readLW link

Fun­da­men­tal Uncer­tainty: Chap­ter 2 - How do words get their mean­ing?

Gordon Seidoh Worley5 Jun 2025 16:32 UTC
11 points
2 comments11 min readLW link

AI Might Kill Everyone

Bentham's Bulldog5 Jun 2025 15:37 UTC
6 points
0 comments4 min readLW link

AI #119: Good­bye AISI?

Zvi5 Jun 2025 14:00 UTC
42 points
8 comments60 min readLW link
(thezvi.wordpress.com)

Pow­er­ful Predictions

Alvin Ånestrand5 Jun 2025 10:44 UTC
2 points
0 comments6 min readLW link
(forecastingaifutures.substack.com)

Po­ten­tially Use­ful Pro­jects in Wise AI

Chris_Leong5 Jun 2025 8:13 UTC
12 points
0 comments5 min readLW link

Build­ing as gardening

Itay Dreyfus5 Jun 2025 6:41 UTC
3 points
1 comment4 min readLW link
(productidentity.co)

Semi­con­duc­tor Fabs I: The Equipment

nomagicpill4 Jun 2025 22:09 UTC
19 points
0 comments19 min readLW link
(nomagicpill.github.io)

The Stereo­type of the Stereotype

Ike4 Jun 2025 21:06 UTC
58 points
17 comments9 min readLW link

2. Why in­tu­itive com­par­i­sons of large-scale im­pact are unjustified

Anthony DiGiovanni4 Jun 2025 20:30 UTC
25 points
0 comments16 min readLW link

Dat­ing Roundup #6

Zvi4 Jun 2025 20:00 UTC
36 points
2 comments55 min readLW link
(thezvi.wordpress.com)

Ra­tional Prime Calendar

RickHull4 Jun 2025 19:30 UTC
−1 points
0 comments3 min readLW link

A Tech­nique of Pure Reason

Adam Newgas4 Jun 2025 19:07 UTC
11 points
3 comments2 min readLW link

“Flaky break­throughs” per­vade in­ner work — but al­most no one tracks them

Chris Lakin4 Jun 2025 19:02 UTC
207 points
44 comments2 min readLW link
(chrislakin.blog)

[Question] LessOn­line saved my life. Now how do I let go of this house?

RedMan4 Jun 2025 18:47 UTC
22 points
7 comments1 min readLW link

Linkpost: Pre­dict­ing Em­piri­cal AI Re­search Out­comes with Lan­guage Models

quetzal_rainbow4 Jun 2025 18:14 UTC
10 points
1 comment1 min readLW link
(arxiv.org)