Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Intention to Treat
Alicorn
20 Mar 2025 20:01 UTC
68
points
3
comments
2
min read
LW
link
Everything’s An Emergency
omnizoid
20 Mar 2025 17:12 UTC
7
points
0
comments
2
min read
LW
link
Non-Consensual Consent: The Performance of Choice in a Coercive World
Alex_Steiner
20 Mar 2025 17:12 UTC
9
points
0
comments
13
min read
LW
link
[Question]
How far along Metr’s law can AI start automating or helping with alignment research?
Christopher King
20 Mar 2025 15:58 UTC
17
points
13
comments
1
min read
LW
link
What is an alignment tax?
Vishakha
and
Algon
20 Mar 2025 13:06 UTC
2
points
0
comments
1
min read
LW
link
(aisafety.info)
Longtermist Implications of the Existence Neutrality Hypothesis
Maxime Riché
20 Mar 2025 12:20 UTC
2
points
2
comments
21
min read
LW
link
Defense Against The Super-Worms
viemccoy
20 Mar 2025 7:24 UTC
11
points
0
comments
2
min read
LW
link
Socially Graceful Degradation
Screwtape
20 Mar 2025 4:03 UTC
37
points
0
comments
9
min read
LW
link
Daniel Dennett, the Unity of Consciousness, and Animal Minds
stormykat
20 Mar 2025 3:43 UTC
1
point
0
comments
6
min read
LW
link
Improved visualizations of METR Time Horizons paper.
LDJ
19 Mar 2025 23:36 UTC
20
points
4
comments
2
min read
LW
link
The case against “The case against AI alignment”
KvmanThinking
19 Mar 2025 22:40 UTC
2
points
0
comments
1
min read
LW
link
[Question]
Superintelligence Strategy: A Pragmatic Path to… Doom?
Mr Beastly
19 Mar 2025 22:30 UTC
6
points
0
comments
3
min read
LW
link
SHIFT relies on token-level features to de-bias Bias in Bios probes
Tim Hua
19 Mar 2025 21:29 UTC
28
points
2
comments
6
min read
LW
link
Forecasting AI Futures Resource Hub
Alvin Ånestrand
19 Mar 2025 17:26 UTC
2
points
0
comments
2
min read
LW
link
(forecastingaifutures.substack.com)
TBC episode w Dave Kasten from Control AI on AI Policy
Eneasz
19 Mar 2025 17:09 UTC
8
points
0
comments
1
min read
LW
link
(www.thebayesianconspiracy.com)
Prioritizing threats for AI control
ryan_greenblatt
19 Mar 2025 17:09 UTC
47
points
2
comments
10
min read
LW
link
The Illusion of Transparency as a Trust-Building Mechanism
Priyanka Bharadwaj
19 Mar 2025 17:09 UTC
1
point
0
comments
1
min read
LW
link
How Do We Govern AI Well?
kaime
19 Mar 2025 17:08 UTC
2
points
0
comments
25
min read
LW
link
METR: Measuring AI Ability to Complete Long Tasks
Zach Stein-Perlman
19 Mar 2025 16:00 UTC
173
points
42
comments
1
min read
LW
link
(metr.org)
Why I think AI will go poorly for humanity
Alek Westover
19 Mar 2025 15:52 UTC
11
points
0
comments
30
min read
LW
link
Back to top
Next