English writes num­bers backwards

TurnTrout25 Jul 2025 23:00 UTC
8 points
23 comments12 min readLW link
(turntrout.com)

a 9-week trip on retatrutide

AnnaJo25 Jul 2025 21:41 UTC
40 points
4 comments10 min readLW link

How I Spent 2024 Liv­ing Like the World Was Go­ing to End

fernando yt25 Jul 2025 19:29 UTC
8 points
0 comments2 min readLW link
(fernandoyt.substack.com)

A Bond­ing Plat­form for Ra­tional Thinkers – Call for Sugges­tions and Collaboration

Martin Braquet25 Jul 2025 19:23 UTC
4 points
4 comments22 min readLW link
(martinbraquet.com)

[Question] What are the two con­tra­dic­tory the­o­ries of how to eval­u­ate coun­ter­fac­tu­als?

Said Achmiz25 Jul 2025 18:43 UTC
29 points
16 comments1 min readLW link

HPMOR: The (Prob­a­bly) Un­told Lore

25 Jul 2025 18:39 UTC
421 points
156 comments38 min readLW link

An­thropic Faces Po­ten­tially “Busi­ness-End­ing” Copy­right Lawsuit

garrison25 Jul 2025 17:01 UTC
57 points
15 comments9 min readLW link
(www.obsolete.pub)

ChatGPT Agent: evals and safeguards

Zach Stein-Perlman25 Jul 2025 16:30 UTC
15 points
0 comments3 min readLW link

Why I Just Took The Giv­ing What We Can Pledge

Bentham's Bulldog25 Jul 2025 16:24 UTC
−28 points
18 comments3 min readLW link

Ac­cess to agent CoT makes mon­i­tors vuln­er­a­ble to persuasion

25 Jul 2025 16:09 UTC
18 points
0 comments4 min readLW link

Au­tomat­ing AI Safety: What we can do today

25 Jul 2025 14:49 UTC
36 points
0 comments8 min readLW link

In­tro­duc­ing SB53.info

MKodama25 Jul 2025 14:48 UTC
9 points
2 comments7 min readLW link

Amer­ica’s AI Ac­tion Plan Is Pretty Good

Zvi25 Jul 2025 12:10 UTC
21 points
13 comments27 min readLW link
(thezvi.wordpress.com)

A web­site to cre­ate bets with strangers

bice25 Jul 2025 11:06 UTC
7 points
1 comment1 min readLW link

PTF 102: Con­di­tion­al­iza­tion and Events

Ape in the coat25 Jul 2025 6:07 UTC
8 points
0 comments8 min readLW link

We Built a Tool to Pro­tect Your Dataset From Sim­ple Scrapers

25 Jul 2025 5:44 UTC
60 points
9 comments3 min readLW link

The Lev­er­age Cycle

Annapurna24 Jul 2025 21:02 UTC
17 points
0 comments3 min readLW link
(jorgevelez.substack.com)

Recom­men­da­tions for fu­ture AI growth: from ex­po­nen­tial to lin­ear, with eco­nomic anchors

Zabor24 Jul 2025 20:11 UTC
7 points
0 comments2 min readLW link

Build­ing and eval­u­at­ing al­ign­ment au­dit­ing agents

24 Jul 2025 19:22 UTC
47 points
1 comment5 min readLW link

Ful­lrank: Bayesian Noisy Sorting

Max Niederman24 Jul 2025 19:03 UTC
20 points
2 comments3 min readLW link
(maxniederman.com)

SenseMak­ing Sum­mer School 2025, Septem­ber 17-24th

Finn Clancy24 Jul 2025 18:00 UTC
1 point
0 comments1 min readLW link

The Ide­olog­i­cal Spiral

24 Jul 2025 13:00 UTC
11 points
1 comment10 min readLW link
(cognition.cafe)

AI #126: Go Fund Yourself

Zvi24 Jul 2025 13:00 UTC
34 points
3 comments46 min readLW link
(thezvi.wordpress.com)

Su­per­in­tel­li­gence isn’t Ap­prox­i­mated by a Ra­tional Agent

Nicolas Villarreal24 Jul 2025 11:41 UTC
13 points
11 comments12 min readLW link

Tak­ing Abun­dance Seriously

eeeee24 Jul 2025 9:36 UTC
43 points
18 comments12 min readLW link

Cur­sory Anal­y­sis of LLMs in the US Gov (July 2025)

Gatlen Culp24 Jul 2025 8:52 UTC
8 points
0 comments10 min readLW link

Reflec­tions from Ooty re­treat 2.0

24 Jul 2025 6:48 UTC
15 points
2 comments14 min readLW link

So Shrieked ZAR

AdamLacerdo23 Jul 2025 23:25 UTC
10 points
2 comments8 min readLW link

AI Safety x Physics Grand Challenge

23 Jul 2025 21:41 UTC
37 points
0 comments8 min readLW link

Dear Su­per­in­tel­li­gence, please check these con­sid­er­a­tions of your un­prece­dented Importance

chaosmage23 Jul 2025 20:49 UTC
17 points
0 comments3 min readLW link

The Whole Check

JustisMills23 Jul 2025 19:20 UTC
50 points
13 comments4 min readLW link
(justismills.substack.com)

Women Want Safety, Men Want Respect

Gordon Seidoh Worley23 Jul 2025 19:10 UTC
18 points
31 comments4 min readLW link
(uncertainupdates.substack.com)

Dark Lord’s An­swer: Re­view and Eco­nomics Excerpts

Towards_Keeperhood23 Jul 2025 17:45 UTC
16 points
6 comments17 min readLW link

“Be­hav­iorist” RL re­ward func­tions lead to scheming

Steven Byrnes23 Jul 2025 16:55 UTC
56 points
5 comments12 min readLW link

Rea­son­ing-Fine­tun­ing Repur­poses La­tent Rep­re­sen­ta­tions in Base Models

23 Jul 2025 16:18 UTC
35 points
1 comment2 min readLW link
(arxiv.org)

Healthy AI re­la­tion­ships as a microcosm

Raymond Douglas23 Jul 2025 15:59 UTC
13 points
0 comments2 min readLW link

In­vol­un­tary One Box­ers—Why Dis­po­si­tion Doesn’t (Always) Matter

Nickolas Cavagnaro23 Jul 2025 15:45 UTC
4 points
3 comments4 min readLW link

Ten AI safety pro­jects I’d like peo­ple to work on

Julian Hazell23 Jul 2025 15:28 UTC
5 points
2 comments10 min readLW link
(thirdthing.ai)

Anti-Su­per­per­sua­sion Interventions

23 Jul 2025 15:18 UTC
21 points
1 comment5 min readLW link

Steer­ing Out-of-Distri­bu­tion Gen­er­al­iza­tion with Con­cept Abla­tion Fine-Tuning

23 Jul 2025 14:57 UTC
78 points
3 comments5 min readLW link

Trans­form­ers Don’t Need Lay­erNorm at In­fer­ence Time: Im­pli­ca­tions for Interpretability

23 Jul 2025 14:55 UTC
31 points
0 comments7 min readLW link

GPT Agent Is Stand­ing By

Zvi23 Jul 2025 14:20 UTC
25 points
1 comment12 min readLW link
(thezvi.wordpress.com)

Agent 002: A story about how ar­tifi­cial in­tel­li­gence might soon de­stroy humanity

Jakub Growiec23 Jul 2025 13:56 UTC
5 points
0 comments26 min readLW link

Beyond in­tel­li­gence: why wis­dom mat­ters in AI systems

Chris Cooper23 Jul 2025 11:57 UTC
6 points
0 comments7 min readLW link

A brief per­spec­tive from an IMO coordinator

DirectedEvolution23 Jul 2025 7:19 UTC
36 points
7 comments1 min readLW link
(www.reddit.com)

Trusted mon­i­tor­ing, but with de­cep­tion probes.

23 Jul 2025 5:26 UTC
31 points
0 comments4 min readLW link
(arxiv.org)

TT Self Study Jour­nal # 3

TristanTrim23 Jul 2025 3:46 UTC
6 points
0 comments6 min readLW link

I tried re­pro­duc­ing that Lancet study about USAID cuts so you don’t have to

rba23 Jul 2025 3:05 UTC
8 points
2 comments11 min readLW link

On “ChatGPT Psy­chosis” and LLM Sycophancy

jdp23 Jul 2025 1:11 UTC
142 points
28 comments18 min readLW link
(minihf.com)

Ex­plain­ing your life with self-re­flec­tive AIXI (an in­ter­lude)

Cole Wyeth23 Jul 2025 0:57 UTC
16 points
0 comments5 min readLW link