Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Have we already lost? Part 2: Reasons for Doom
LawrenceC
10 Apr 2026 6:56 UTC
23
points
0
comments
3
min read
LW
link
An AI alignment research agenda based on asymmetric debate and monitoring.
emanuelr
10 Apr 2026 6:23 UTC
2
points
0
comments
17
min read
LW
link
Inkhaven menu, part 2
David Scott Krueger (formerly: capybaralet)
10 Apr 2026 5:20 UTC
8
points
0
comments
3
min read
LW
link
(therealartificialintelligence.substack.com)
Linear vs Non-linear Probes for Interpretability
NickyP
10 Apr 2026 4:44 UTC
12
points
0
comments
4
min read
LW
link
(blog.sus.cat)
AI identity is not tied to its model
Sean Herrington
10 Apr 2026 4:07 UTC
14
points
1
comment
2
min read
LW
link
Anthropic did not publish a “risk discussion” of Mythos when required by their RSP
RobertM
10 Apr 2026 3:52 UTC
52
points
3
comments
5
min read
LW
link
Some takes on UV & cancer
Steven Byrnes
10 Apr 2026 0:31 UTC
41
points
2
comments
9
min read
LW
link
My Specific Singularity Timeline to Utopia
Michael Soareverix
10 Apr 2026 0:11 UTC
5
points
0
comments
6
min read
LW
link
Model organisms researchers should check whether high LRs defeat their model organisms
dx26
,
Sebastian Prasanna
,
Alek Westover
,
Vivek Hebbar
and
Julian Stastny
10 Apr 2026 0:07 UTC
31
points
0
comments
5
min read
LW
link
Climbing Mountains We Cannot Name
Tharin
9 Apr 2026 22:28 UTC
3
points
0
comments
4
min read
LW
link
(www.echoesandchimes.com)
Help me launch Obsolete: a book aimed at building a new movement for AI reform
garrison
9 Apr 2026 19:17 UTC
81
points
3
comments
7
min read
LW
link
Aliens from our own Solar System
RomanS
9 Apr 2026 18:50 UTC
4
points
5
comments
4
min read
LW
link
How Unmonitored External Agents can Sabotage AI labs
Elle Najt
and
Fabien Roger
9 Apr 2026 18:07 UTC
18
points
0
comments
9
min read
LW
link
Video and transcript of talk on writing AI constitutions
Joe Carlsmith
9 Apr 2026 17:14 UTC
14
points
0
comments
47
min read
LW
link
Writing With Robots
Against Moloch
9 Apr 2026 16:01 UTC
22
points
2
comments
14
min read
LW
link
Slightly-Super Persuasion Will Do
Tomás B.
9 Apr 2026 16:00 UTC
54
points
7
comments
4
min read
LW
link
Outrospection: Don’t Be A Rock
J Bostock
9 Apr 2026 15:45 UTC
24
points
0
comments
2
min read
LW
link
Have we already lost? Part 1: The Plan in 2024
LawrenceC
9 Apr 2026 6:47 UTC
56
points
3
comments
3
min read
LW
link
Stockfish is not a chess superintelligence (and it doesn’t need to be)
Sean Herrington
9 Apr 2026 4:29 UTC
24
points
6
comments
2
min read
LW
link
Do not be surprised if LessWrong gets hacked
RobertM
9 Apr 2026 3:42 UTC
160
points
30
comments
4
min read
LW
link
Back to top
Next