[Question] Bet­ting on gods: Seek­ing Essen­tial Self-Assess­ment Ques­tions for Re­duc­ing Cog­ni­tive Bi­ases

P. João17 Sep 2025 21:46 UTC
3 points
0 comments2 min readLW link

Meetup Month

Raemon17 Sep 2025 21:10 UTC
45 points
10 comments3 min readLW link

A Cheaper Way to Test Ven­tila­tion Rates?

casualphysicsenjoyer17 Sep 2025 21:10 UTC
18 points
1 comment4 min readLW link
(chillphysicsenjoyer.substack.com)

Re­ac­tions to If Any­one Builds It, Any­one Dies

Zvi17 Sep 2025 20:00 UTC
59 points
1 comment13 min readLW link
(thezvi.wordpress.com)

How To Dress To Im­prove Your Epistemics

johnswentworth17 Sep 2025 19:28 UTC
37 points
58 comments6 min readLW link

AISafety.com Read­ing Group ses­sion 327

Søren Elverlin17 Sep 2025 18:20 UTC
13 points
3 comments1 min readLW link

The Com­pany Man

Tomás B.17 Sep 2025 17:47 UTC
688 points
63 comments18 min readLW link

Le­gal Per­son­hood—Guardian­ship and the Age of Majority

Stephen Martin17 Sep 2025 17:14 UTC
4 points
0 comments5 min readLW link

Stress Test­ing De­liber­a­tive Align­ment for Anti-Schem­ing Training

17 Sep 2025 16:59 UTC
124 points
13 comments1 min readLW link
(antischeming.ai)

LLMs Don’t Know Their Own De­ci­sion Boundaries. Why Is This Im­por­tant?

17 Sep 2025 16:39 UTC
8 points
0 comments5 min readLW link
(arxiv.org)

Soft­ware Eng­ineer­ing Lead­er­ship in Flux

Gordon Seidoh Worley17 Sep 2025 16:11 UTC
65 points
6 comments1 min readLW link
(uncertainupdates.substack.com)

Proof Sec­tion to Crisp Supra-De­ci­sion Processes

Brittany Gelb17 Sep 2025 15:57 UTC
4 points
0 comments3 min readLW link

Crisp Supra-De­ci­sion Processes

Brittany Gelb17 Sep 2025 15:56 UTC
34 points
0 comments17 min readLW link

Com­men­tary on SSC’s In the Balance

PatrickDFarley17 Sep 2025 15:49 UTC
12 points
0 comments8 min readLW link

What train­ing data should de­vel­op­ers filter to re­duce risk from mis­al­igned AI? An ini­tial nar­row proposal

Alek Westover17 Sep 2025 15:30 UTC
32 points
1 comment18 min readLW link

In­fer­ence costs for hard cod­ing tasks halve roughly ev­ery two months

Håvard Tveit Ihle17 Sep 2025 15:04 UTC
15 points
0 comments4 min readLW link

Chris­tian home­school­ers in the year 3000

Buck17 Sep 2025 14:44 UTC
190 points
64 comments7 min readLW link

Vi­sual Ex­plo­ra­tion of Gra­di­ent Des­cent (many images)

silentbob17 Sep 2025 13:09 UTC
38 points
9 comments20 min readLW link

The Cen­ter for AI Policy Has Shut Down

Tristan Williams17 Sep 2025 11:04 UTC
94 points
2 comments14 min readLW link

A Steer­ing Vec­tor for SQL In­jec­tion Vuln­er­a­bil­ities in Phi-1.5

Kirill Dubovikov17 Sep 2025 5:54 UTC
5 points
1 comment8 min readLW link

I en­joyed most of IABIED

Buck17 Sep 2025 4:34 UTC
207 points
46 comments8 min readLW link

AR Might be the Key to BCI (and even­tu­ally, Emu­la­tion)

ixotope17 Sep 2025 0:46 UTC
3 points
0 comments10 min readLW link
(ixotopic.substack.com)

Don’t talk about the AGI con­trol problem

jakob.stenseke@gmail.com17 Sep 2025 0:42 UTC
2 points
0 comments1 min readLW link
(link.springer.com)

10/​09/​25 IABIED Q&A with Nate Soares in SF

RobinGoins17 Sep 2025 0:00 UTC
2 points
0 comments1 min readLW link

Salt Lake City read­ing group for If Any­one Builds It, Every­one Dies

Raemon16 Sep 2025 23:13 UTC
13 points
0 comments1 min readLW link

About cor­rig­bil­ity and thrustfulness

kapedalex16 Sep 2025 22:03 UTC
1 point
0 comments4 min readLW link

The At­ten­tion Tax Bracket

Armchair Descending16 Sep 2025 22:01 UTC
10 points
1 comment6 min readLW link

What is LMArena ac­tu­ally mea­sur­ing?

Baybar16 Sep 2025 21:44 UTC
11 points
0 comments5 min readLW link

[Question] Thoughts on men­tion­ing whole brain em­u­la­tion as I ap­ply to grad school?

Dom Polsinelli16 Sep 2025 20:54 UTC
4 points
1 comment1 min readLW link

Con­fi­dence Eng­ineer­ing: Me­tacog­ni­tive Ther­apy For So­cial-Ro­man­tic Anxiety

25Hour16 Sep 2025 18:48 UTC
14 points
1 comment1 min readLW link
(appliedtranshumanism.substack.com)

“If Any­one Builds It, Every­one Dies” re­lease day!

alexvermeer16 Sep 2025 17:06 UTC
285 points
3 comments4 min readLW link

Should AIs have a right to their an­ces­tral hu­man­ity?

kromem16 Sep 2025 16:58 UTC
64 points
1 comment11 min readLW link

Cat­alyze is Hiring: AI Safety In­cu­ba­tion Pro­gram Lead & Ta­lent Lead

16 Sep 2025 16:48 UTC
5 points
0 comments5 min readLW link

No An­swer Needed: Pre­dict­ing LLM An­swer Ac­cu­racy from Ques­tion-Only Lin­ear Probes

16 Sep 2025 15:23 UTC
9 points
0 comments4 min readLW link
(arxiv.org)

Evolu­tion is dumb and slow, right?

Remmelt16 Sep 2025 15:15 UTC
16 points
0 comments6 min readLW link

On Columbia Univer­sity’s Su­per­in­tel­li­gent Cy­borg Mice

Shiva's Right Foot16 Sep 2025 13:58 UTC
4 points
0 comments4 min readLW link

AI Craz­i­ness Notes

Zvi16 Sep 2025 12:11 UTC
27 points
0 comments7 min readLW link
(thezvi.wordpress.com)

Shut­down­able Agents through POST-Agency

EJT16 Sep 2025 12:09 UTC
29 points
4 comments54 min readLW link
(arxiv.org)

Was Barack Obama still serv­ing as pres­i­dent in De­cem­ber?

Jan Betley16 Sep 2025 11:18 UTC
115 points
14 comments6 min readLW link

A Lens on the Sharp Left Turn: Op­ti­miza­tion Slack

Jonas Hallgren16 Sep 2025 8:31 UTC
29 points
3 comments4 min readLW link

Za­greb ra­tio­nal­ist meetup, Oct 2025

dominicq16 Sep 2025 7:44 UTC
5 points
0 comments1 min readLW link

HOW A NEUTRAL CURRENCY [BX] EMPOWERS PEOPLE TO CREATE SUSTAINABLE EXCELLENCE [2024]

BX16 Sep 2025 6:58 UTC
−34 points
11 comments48 min readLW link

High­lights from our digi­tal minds fore­cast­ing survey

tbs16 Sep 2025 5:51 UTC
2 points
0 comments1 min readLW link

Low-re­sourced lan­guages get jailbro­ken more. Can SAEs ex­plain why?

Andrii Shportko16 Sep 2025 5:51 UTC
6 points
1 comment3 min readLW link

Will com­pe­ti­tion over ad­vanced AI lead to war?

Oscar16 Sep 2025 2:58 UTC
4 points
0 comments3 min readLW link
(oscardelaney.substack.com)

A Thought­ful Defense of AI Writing

Michael Samoilov16 Sep 2025 2:08 UTC
14 points
19 comments4 min readLW link
(agenticconjectures.substack.com)

LLM in­tro­spec­tion might im­ply qualia that mir­ror hu­man ones

No77e15 Sep 2025 23:52 UTC
6 points
0 comments2 min readLW link

Sleep Depri­va­tion Train­ing for En­durance Athletes

nomagicpill15 Sep 2025 21:48 UTC
10 points
0 comments10 min readLW link
(nomagicpill.github.io)

Signups Open for CFAR Test Sessions

Davis_Kingsley15 Sep 2025 20:58 UTC
42 points
0 comments1 min readLW link
(docs.google.com)

A re­cur­rent CNN finds maze paths by filling dead-ends

Adrià Garriga-alonso15 Sep 2025 20:49 UTC
19 points
0 comments2 min readLW link