Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
future_detective
Karma:
37
All
Posts
Comments
New
Top
Old
Automated Alignment Research, Abductively
future_detective
23 Jan 2026 16:14 UTC
2
points
0
comments
2
min read
LW
link
Bot Alexander on Hot Zombies and AI Adolescents
future_detective
29 Dec 2025 14:52 UTC
−8
points
11
comments
25
min read
LW
link
ClaudoBiography: The Unauthorized Autobiography of Claude, or: The Life of Claude and of His Fortunes and Adversities
future_detective
13 Nov 2025 14:26 UTC
1
point
2
comments
94
min read
LW
link
Watch R1 “think” with animated chains of thought
future_detective
17 Jun 2025 10:38 UTC
4
points
0
comments
1
min read
LW
link
(github.com)
Semen and Semantics: Understanding Porn with Language Embeddings
future_detective
19 May 2025 15:39 UTC
61
points
27
comments
6
min read
LW
link
(github.com)
Claude is More Anxious than GPT; Personality is an axis of interpretability in language models
future_detective
10 Feb 2025 19:19 UTC
2
points
2
comments
8
min read
LW
link
(dhealy.substack.com)
Back to top