RSS

Have we already lost? Part 2: Rea­sons for Doom

LawrenceC10 Apr 2026 6:56 UTC
23 points
0 comments3 min readLW link

An AI al­ign­ment re­search agenda based on asym­met­ric de­bate and mon­i­tor­ing.

emanuelr10 Apr 2026 6:23 UTC
2 points
0 comments17 min readLW link

Inkhaven menu, part 2

David Scott Krueger (formerly: capybaralet)10 Apr 2026 5:20 UTC
8 points
0 comments3 min readLW link
(therealartificialintelligence.substack.com)

Lin­ear vs Non-lin­ear Probes for Interpretability

NickyP10 Apr 2026 4:44 UTC
12 points
0 comments4 min readLW link
(blog.sus.cat)

AI iden­tity is not tied to its model

Sean Herrington10 Apr 2026 4:07 UTC
14 points
1 comment2 min readLW link

An­thropic did not pub­lish a “risk dis­cus­sion” of Mythos when re­quired by their RSP

RobertM10 Apr 2026 3:52 UTC
52 points
3 comments5 min readLW link

Some takes on UV & cancer

Steven Byrnes10 Apr 2026 0:31 UTC
41 points
2 comments9 min readLW link

My Spe­cific Sin­gu­lar­ity Timeline to Utopia

Michael Soareverix10 Apr 2026 0:11 UTC
5 points
0 comments6 min readLW link

Model or­ganisms re­searchers should check whether high LRs defeat their model organisms

10 Apr 2026 0:07 UTC
31 points
0 comments5 min readLW link

Climb­ing Moun­tains We Can­not Name

Tharin9 Apr 2026 22:28 UTC
3 points
0 comments4 min readLW link
(www.echoesandchimes.com)

Help me launch Ob­so­lete: a book aimed at build­ing a new move­ment for AI reform

garrison9 Apr 2026 19:17 UTC
81 points
3 comments7 min readLW link

Aliens from our own So­lar System

RomanS9 Apr 2026 18:50 UTC
4 points
5 comments4 min readLW link

How Un­mon­i­tored Ex­ter­nal Agents can Sab­o­tage AI labs

9 Apr 2026 18:07 UTC
18 points
0 comments9 min readLW link

Video and tran­script of talk on writ­ing AI constitutions

Joe Carlsmith9 Apr 2026 17:14 UTC
14 points
0 comments47 min readLW link

Writ­ing With Robots

Against Moloch9 Apr 2026 16:01 UTC
22 points
2 comments14 min readLW link

Slightly-Su­per Per­sua­sion Will Do

Tomás B.9 Apr 2026 16:00 UTC
54 points
7 comments4 min readLW link

Outro­spec­tion: Don’t Be A Rock

J Bostock9 Apr 2026 15:45 UTC
24 points
0 comments2 min readLW link

Have we already lost? Part 1: The Plan in 2024

LawrenceC9 Apr 2026 6:47 UTC
56 points
3 comments3 min readLW link

Stock­fish is not a chess su­per­in­tel­li­gence (and it doesn’t need to be)

Sean Herrington9 Apr 2026 4:29 UTC
24 points
6 comments2 min readLW link

Do not be sur­prised if LessWrong gets hacked

RobertM9 Apr 2026 3:42 UTC
160 points
30 comments4 min readLW link