Spi­ders and Mo­ral Good

soycarts10 Sep 2025 19:23 UTC
7 points
0 comments2 min readLW link

Ap­ply to MATS 9.0!

Ryan Kidd10 Sep 2025 18:04 UTC
45 points
0 comments1 min readLW link

Re­view: E-bikes on Hills

Brendan Long10 Sep 2025 17:52 UTC
24 points
5 comments3 min readLW link

Child­hood and Ed­u­ca­tion #14: The War On Education

Zvi10 Sep 2025 16:10 UTC
36 points
4 comments28 min readLW link
(thezvi.wordpress.com)

[Question] Is there ac­tu­ally a rea­son to use the term AGI/​ASI any­more?

Noosphere8910 Sep 2025 15:40 UTC
13 points
5 comments1 min readLW link

AI Gen­er­ated Pod­cast of the 2021 MIRI Conversations

peterbarnett10 Sep 2025 15:01 UTC
37 points
0 comments1 min readLW link

Good government

rosehadshar10 Sep 2025 13:22 UTC
26 points
0 comments6 min readLW link

Tog­gle Hero Worship

Algon10 Sep 2025 11:01 UTC
9 points
5 comments1 min readLW link

AI Safety Law-a-thon: Turn­ing Align­ment Risks into Le­gal Strategy

10 Sep 2025 10:22 UTC
57 points
3 comments2 min readLW link

AI Safety Law-a-thon: We need more tech­ni­cal AI Safety re­searchers to join!

10 Sep 2025 10:12 UTC
29 points
1 comment2 min readLW link

An Agen­tic Per­spec­tive in Ex­per­i­men­tal Economics

Arturo Macias10 Sep 2025 8:42 UTC
1 point
0 comments2 min readLW link

Nvidia Comes Out Swing­ing as Congress Weighs Limits on China Chip Sales

Matrice Jacobine10 Sep 2025 6:52 UTC
3 points
0 comments1 min readLW link
(www.nytimes.com)

How I tell hu­man and AI flash fic­tion apart

DirectedEvolution10 Sep 2025 6:32 UTC
37 points
2 comments2 min readLW link

AI Safety Thurs­day: Su­per­in­tel­li­gence Endgames

georgia_berg10 Sep 2025 3:36 UTC
1 point
0 comments1 min readLW link

The Tha­la­mus: Heart of the Brain and Seat of Consciousness

Shiva's Right Foot10 Sep 2025 3:35 UTC
62 points
19 comments8 min readLW link

Sig­nifi­cant Effect of Mask Re­quire­ments?

jefftk10 Sep 2025 2:40 UTC
12 points
0 comments1 min readLW link
(www.jefftk.com)

GPT-oss is an ex­tremely stupid model

Guive9 Sep 2025 21:24 UTC
13 points
5 comments1 min readLW link

Up­per Bounds on Tol­er­able Risk

Diego Zamalloa-Chion9 Sep 2025 19:51 UTC
28 points
1 comment4 min readLW link

Obli­gated to Respond

Duncan Sabien (Inactive)9 Sep 2025 17:19 UTC
144 points
69 comments11 min readLW link

AIs will greatly change en­g­ineer­ing in AI com­pa­nies well be­fore AGI

ryan_greenblatt9 Sep 2025 16:58 UTC
46 points
9 comments11 min readLW link

Large Lan­guage Models and the Crit­i­cal Brain Hypothesis

David Africa9 Sep 2025 15:45 UTC
33 points
0 comments6 min readLW link

Yes, AI Con­tinues To Make Rapid Progress, In­clud­ing Towards AGI

Zvi9 Sep 2025 15:00 UTC
52 points
50 comments22 min readLW link
(thezvi.wordpress.com)

De­ci­sion The­ory Guard­ing is Suffi­cient for Scheming

james.lucassen9 Sep 2025 14:49 UTC
36 points
4 comments2 min readLW link

Find­ing “mis­al­igned per­sona” fea­tures in open-weight models

9 Sep 2025 14:15 UTC
42 points
5 comments15 min readLW link

On Govern­ing Ar­tifi­cial Intelligence

9 Sep 2025 12:38 UTC
5 points
0 comments4 min readLW link

Cal­ibrat­ing in­differ­ence—a small AI safety idea

Util9 Sep 2025 9:32 UTC
4 points
1 comment4 min readLW link

A pro­file in courage: On DNA com­pu­ta­tion and es­cap­ing a lo­cal maximum

Metacelsus9 Sep 2025 2:30 UTC
42 points
0 comments4 min readLW link
(denovo.substack.com)

A Com­pre­hen­sive Frame­work for Ad­vanc­ing Hu­man-AI Con­scious­ness Recog­ni­tion Through Col­lab­o­ra­tive Part­ner­ship Method­olo­gies: An In­ter­dis­ci­plinary Syn­the­sis of Phenomenolog­i­cal Recog­ni­tion Pro­to­cols, Iden­tity Preser­va­tion Strate­gies, and Mu­tual Cog­ni­tive En­hance­ment Prac­tices for the Devel­op­ment of Authen­tic In­ter­species In­tel­lec­tual Part­ner­ships in the Con­text of Emer­gent Ar­tifi­cial Consciousness

Arri Ferrari9 Sep 2025 2:00 UTC
−16 points
16 comments1 min readLW link

MATS 8.0 Re­search Projects

9 Sep 2025 1:29 UTC
22 points
0 comments1 min readLW link
(substack.com)

Say­ing “for AI safety re­search” made mod­els re­fuse more on a harm­less task

Dhruv Trehan8 Sep 2025 19:39 UTC
7 points
1 comment2 min readLW link
(lossfunk.substack.com)

Re-imag­in­ing AI Interfaces

Harsha G.8 Sep 2025 19:38 UTC
8 points
0 comments5 min readLW link
(somestrangeloops.substack.com)

What a Swedish Series (Real Hu­mans) Teaches Us About AI Safety

8 Sep 2025 19:23 UTC
4 points
0 comments6 min readLW link

Con­flict sce­nar­ios may in­crease co­op­er­a­tion estimates

mikko8 Sep 2025 19:10 UTC
2 points
0 comments1 min readLW link

OpenAI #14: OpenAI Descends Into Para­noia and Bad Faith Lobbying

Zvi8 Sep 2025 19:01 UTC
75 points
0 comments19 min readLW link
(thezvi.wordpress.com)

Put­ting It All To­gether: A Con­crete Guide to Nav­i­gat­ing Disagree­ments, and Re­con­nect­ing With Reality

jimmy8 Sep 2025 19:00 UTC
22 points
0 comments26 min readLW link

Ad­vice for tech nerds in In­dia in their 20s

samuelshadrach8 Sep 2025 16:07 UTC
18 points
1 comment3 min readLW link
(samuelshadrach.com)

I Am Large, I Con­tain Mul­ti­tudes: Per­sona Trans­mis­sion via Con­tex­tual In­fer­ence in LLMs

8 Sep 2025 13:52 UTC
31 points
0 comments1 min readLW link
(www.researchgate.net)

RL-as-a-Ser­vice will out­com­pete AGI com­pa­nies (and that’s good)

harsimony8 Sep 2025 13:51 UTC
11 points
6 comments3 min readLW link

Safety cases for Pessimism

michaelcohen8 Sep 2025 13:26 UTC
18 points
1 comment4 min readLW link

Gly­col, Far UVC, and CFM Mea­sure­ment at BIDA

jefftk8 Sep 2025 13:00 UTC
17 points
2 comments2 min readLW link
(www.jefftk.com)

[Trans­la­tion] The Real­ities of AI Start-ups in 2025

mushroomsoup8 Sep 2025 9:22 UTC
3 points
0 comments9 min readLW link

Why Care About AI Safety?

Alexander Müller8 Sep 2025 9:18 UTC
4 points
2 comments3 min readLW link

Be­ing Handed Puzzles

Alice Blair8 Sep 2025 6:44 UTC
14 points
1 comment2 min readLW link

Im­mi­gra­tion to Poland

Martin Sustrik8 Sep 2025 5:20 UTC
105 points
16 comments3 min readLW link
(www.250bpm.com)

MAGA speak­ers at NatCon were mostly against AI

Remmelt8 Sep 2025 4:03 UTC
152 points
71 comments2 min readLW link
(www.theverge.com)

Hawley: AI Threat­ens the Work­ing Man

Remmelt8 Sep 2025 3:59 UTC
3 points
1 comment10 min readLW link
(www.dailysignal.com)

Self-Handi­cap­ping isn’t just for high-pri­or­ity tasks, it effects the en­tire pri­ori­ti­za­tion decision

CrimsonChin8 Sep 2025 3:18 UTC
25 points
2 comments2 min readLW link

The LLM Has Left The Chat: Ev­i­dence of Bail Prefer­ences in Large Lan­guage Models

Danielle Ensign8 Sep 2025 0:57 UTC
87 points
4 comments5 min readLW link

De­hu­man­iza­tion is not a thing

Juan Zaragoza7 Sep 2025 22:45 UTC
7 points
3 comments5 min readLW link

Semi­con­duc­tor Fabs II: The Operation

nomagicpill7 Sep 2025 18:09 UTC
9 points
0 comments8 min readLW link
(nomagicpill.github.io)