Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Biological Computing Underhang
Elliot Callender
10 Apr 2026 20:49 UTC
3
points
0
comments
2
min read
LW
link
The Unintelligibility is Ours: Notes on Chain-of-Thought
1a3orn
10 Apr 2026 18:33 UTC
26
points
3
comments
9
min read
LW
link
Anthropic is Really Pushing the Frontier, What Should We Think?
Corm
10 Apr 2026 18:25 UTC
12
points
0
comments
11
min read
LW
link
In Defense of Debate
Elliot Temple
10 Apr 2026 17:50 UTC
1
point
0
comments
7
min read
LW
link
“Close Enough” as a Primitive in Intelligent Systems
J Bostock
10 Apr 2026 16:04 UTC
10
points
0
comments
2
min read
LW
link
Foundational Beliefs
Against Moloch
10 Apr 2026 15:27 UTC
9
points
0
comments
4
min read
LW
link
(againstmoloch.com)
Why Control Creates Conflict, and When to Open Instead
plex
10 Apr 2026 14:56 UTC
47
points
2
comments
3
min read
LW
link
On creating ‘new knobs of control’ in biology
Abhishaike Mahajan
10 Apr 2026 13:23 UTC
15
points
0
comments
16
min read
LW
link
Chocolate Sloths, Tinder, and Moral Backstops
J Bostock
10 Apr 2026 12:18 UTC
25
points
0
comments
4
min read
LW
link
Reproducing steering against evaluation awareness in a large open-weight model
Thomas Read
,
Bronson Schoen
and
Joseph Bloom
10 Apr 2026 10:45 UTC
64
points
9
comments
15
min read
LW
link
Have we already lost? Part 2: Reasons for Doom
LawrenceC
10 Apr 2026 6:56 UTC
43
points
3
comments
3
min read
LW
link
An AI alignment research agenda based on asymmetric debate and monitoring.
emanuelr
10 Apr 2026 6:23 UTC
3
points
0
comments
17
min read
LW
link
Inkhaven menu, part 2
David Scott Krueger (formerly: capybaralet)
10 Apr 2026 5:20 UTC
8
points
0
comments
3
min read
LW
link
(therealartificialintelligence.substack.com)
Linear vs Non-linear Probes for Interpretability
NickyP
10 Apr 2026 4:44 UTC
13
points
0
comments
4
min read
LW
link
(blog.sus.cat)
AI identity is not tied to its model
Sean Herrington
10 Apr 2026 4:07 UTC
18
points
10
comments
2
min read
LW
link
Anthropic did not publish a “risk discussion” of Mythos when required by their RSP
RobertM
10 Apr 2026 3:52 UTC
78
points
4
comments
5
min read
LW
link
Some takes on UV & cancer
Steven Byrnes
10 Apr 2026 0:31 UTC
44
points
6
comments
9
min read
LW
link
My Specific Singularity Timeline to Utopia
Michael Soareverix
10 Apr 2026 0:11 UTC
5
points
0
comments
6
min read
LW
link
Model organisms researchers should check whether high LRs defeat their model organisms
dx26
,
Sebastian Prasanna
,
Alek Westover
,
Vivek Hebbar
and
Julian Stastny
10 Apr 2026 0:07 UTC
36
points
0
comments
5
min read
LW
link
Climbing Mountains We Cannot Name
Tharin
9 Apr 2026 22:28 UTC
3
points
0
comments
4
min read
LW
link
(www.echoesandchimes.com)
Back to top
Next