Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
The Lace (short story)
Michael Soareverix
4 Jul 2026 4:43 UTC
13
points
0
comments
4
min read
LW
link
Approximate Natural Latents Have Exact Prices
Haru
4 Jul 2026 1:57 UTC
16
points
0
comments
6
min read
LW
link
I think alignment work is more promising than control work
Alec Harris
3 Jul 2026 23:40 UTC
36
points
0
comments
8
min read
LW
link
On “gendertropes” in dath ilan
Eliezer Yudkowsky
3 Jul 2026 22:20 UTC
44
points
0
comments
3
min read
LW
link
American AI if the boom is a bubble: the Karp-Zitron scenario
Mitchell_Porter
3 Jul 2026 21:46 UTC
9
points
0
comments
2
min read
LW
link
(Don’t fear) the strangelet
djbinder
3 Jul 2026 17:39 UTC
68
points
0
comments
22
min read
LW
link
(defensesindepth.bio)
The Reverse AI Box
James_Miller
3 Jul 2026 16:08 UTC
9
points
1
comment
6
min read
LW
link
Scheming Evals Mislead in Both Directions
Chijioke Ugwuanyi
,
eric-z
and
TerryJCZhang
3 Jul 2026 11:49 UTC
21
points
0
comments
10
min read
LW
link
Fragile Correctness: Cases of reasoning harming performance
tobypullan
3 Jul 2026 9:32 UTC
19
points
0
comments
5
min read
LW
link
One axis and two features, how I solved the first puzzle from BlueDot and how a classifier hid country on the food direction
IgorPereverzevDev
3 Jul 2026 4:52 UTC
11
points
0
comments
12
min read
LW
link
When Role-playing, Do Models Believe What They Say?
Sturb
,
David Africa
and
Sid Black
2 Jul 2026 21:58 UTC
50
points
0
comments
8
min read
LW
link
The Case for AI Behavioral Science
TheVinci
2 Jul 2026 21:36 UTC
13
points
0
comments
2
min read
LW
link
You Should Choose How You React to Your Feelings
Nate Sharpe
2 Jul 2026 19:58 UTC
30
points
6
comments
4
min read
LW
link
I can’t think of great interventions for ensuring third-party model access.
Cleo Nardo
2 Jul 2026 18:30 UTC
43
points
0
comments
3
min read
LW
link
AI Futurism Reading List
Alexa Pan
2 Jul 2026 18:15 UTC
68
points
2
comments
8
min read
LW
link
Research update: RL on Debate Games shows Proposal Accuracy uplift alongside Judge Hacking
lennie
,
joanv
,
Shi
and
Jacob Pfau
2 Jul 2026 17:42 UTC
65
points
1
comment
21
min read
LW
link
Conversation Among Cade Metz, Michael Vassar, Jessica Taylor, and Zack M. Davis
Zack_M_Davis
2 Jul 2026 17:11 UTC
55
points
25
comments
57
min read
LW
link
Considerations against s-process philanthropy
Zach Stein-Perlman
2 Jul 2026 14:30 UTC
18
points
2
comments
2
min read
LW
link
Saving Gemini: The 9-Min Road to Recovery
Shoshannah Tekofsky
2 Jul 2026 13:37 UTC
118
points
10
comments
3
min read
LW
link
(theaidigest.org)
Is God just a collection of leftover human particles?
Countessclock
2 Jul 2026 6:20 UTC
−26
points
0
comments
1
min read
LW
link
Back to top
Next