Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Every Major LLM Endorses Newcomb One-Boxing
jackmastermind
15 Jun 2025 20:44 UTC
13
points
1
comment
1
min read
LW
link
(jacktlab.substack.com)
FDT Does Not Endorse Itself in Asymmetric Games
jackmastermind
15 Jun 2025 20:44 UTC
13
points
1
comment
5
min read
LW
link
Can We Change the Goals of a Toy RL Agent?
tuphs
and
Adrià Garriga-alonso
15 Jun 2025 20:34 UTC
1
point
0
comments
9
min read
LW
link
Some reprogenetics-related projects you could help with
TsviBT
15 Jun 2025 20:25 UTC
68
points
0
comments
4
min read
LW
link
Intelligence Is Not Magic, But Your Threshold For “Magic” Is Pretty Low
Expertium
15 Jun 2025 15:23 UTC
85
points
13
comments
1
min read
LW
link
Estrogen: A trip report
cube_flipper
15 Jun 2025 13:15 UTC
62
points
0
comments
27
min read
LW
link
(smoothbrains.net)
Book review: Air-borne by Carl Zimmer
eukaryote
15 Jun 2025 5:49 UTC
24
points
0
comments
11
min read
LW
link
(eukaryotewritesblog.com)
Endometriosis is an incredibly interesting disease
Abhishaike Mahajan
14 Jun 2025 22:14 UTC
79
points
0
comments
16
min read
LW
link
(www.owlposting.com)
Field Notes from Shipping Real Code with Claude
creatorrr
14 Jun 2025 16:36 UTC
18
points
0
comments
12
min read
LW
link
(diwank.space)
Training Superior Sparse Autoencoders for Instruct Models
Haoran Ye
14 Jun 2025 16:35 UTC
3
points
0
comments
7
min read
LW
link
A Very Simple Case For Giving To Shrimp
Bentham's Bulldog
14 Jun 2025 15:31 UTC
0
points
1
comment
3
min read
LW
link
Why we’re still doing normal school
juliawise
14 Jun 2025 12:40 UTC
69
points
0
comments
3
min read
LW
link
Coaching AI: A Relational Approach to AI Safety
Priyanka Bharadwaj
14 Jun 2025 12:14 UTC
2
points
0
comments
5
min read
LW
link
What Caused the Fertility Collapse?
Zero Contradictions
14 Jun 2025 7:15 UTC
−3
points
2
comments
4
min read
LW
link
Relocation triggers
denkenberger
14 Jun 2025 6:36 UTC
2
points
0
comments
1
min read
LW
link
[Question]
How could I tell someone that consciousness is not the primary concern of AI Safety?
Lysandre Terrisse
13 Jun 2025 22:44 UTC
11
points
2
comments
3
min read
LW
link
Debate experiments at The Curve, LessOnline and Manifest
Nathan Young
13 Jun 2025 22:35 UTC
30
points
8
comments
5
min read
LW
link
(nathanpmyoung.substack.com)
Futarchy’s fundamental flaw
dynomight
13 Jun 2025 22:08 UTC
86
points
23
comments
9
min read
LW
link
(dynomight.net)
The Pros and Cons of Being Among Your Tribe
Sable
13 Jun 2025 21:41 UTC
29
points
0
comments
7
min read
LW
link
(affablyevil.substack.com)
Constraining Minds, Not Goals: A Structural Approach to AI Alignment
Johannes C. Mayer
13 Jun 2025 21:06 UTC
15
points
0
comments
9
min read
LW
link
Back to top
Next