Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Yes Requires the Possibility of No
Scott Garrabrant
17 May 2019 22:39 UTC
261
points
55
comments
2
min read
LW
link
2
reviews
The Parable of Predict-O-Matic
abramdemski
15 Oct 2019 0:49 UTC
342
points
41
comments
14
min read
LW
link
2
reviews
Humans Who Are Not Concentrating Are Not General Intelligences
sarahconstantin
25 Feb 2019 20:40 UTC
186
points
35
comments
6
min read
LW
link
1
review
(srconstantin.wordpress.com)
Understanding “Deep Double Descent”
evhub
6 Dec 2019 0:00 UTC
149
points
51
comments
5
min read
LW
link
4
reviews
Book Review: The Secret Of Our Success
Scott Alexander
5 Jun 2019 6:50 UTC
158
points
19
comments
25
min read
LW
link
2
reviews
(slatestarcodex.com)
Moloch Hasn’t Won
Zvi
28 Dec 2019 16:30 UTC
179
points
40
comments
7
min read
LW
link
1
review
(thezvi.wordpress.com)
The Power to Demolish Bad Arguments
Liron
2 Sep 2019 12:57 UTC
97
points
83
comments
11
min read
LW
link
6
reviews
Being the (Pareto) Best in the World
johnswentworth
24 Jun 2019 18:36 UTC
405
points
57
comments
3
min read
LW
link
3
reviews
Asymmetric Justice
Zvi
25 Apr 2019 16:00 UTC
230
points
101
comments
5
min read
LW
link
2
reviews
(thezvi.wordpress.com)
Noticing Frame Differences
Raemon
30 Sep 2019 1:24 UTC
210
points
39
comments
9
min read
LW
link
2
reviews
Excerpts from a larger discussion about simulacra
Benquo
10 Apr 2019 21:27 UTC
53
points
40
comments
6
min read
LW
link
5
reviews
(benjaminrosshoffman.com)
Coherent decisions imply consistent utilities
Eliezer Yudkowsky
12 May 2019 21:33 UTC
148
points
81
comments
26
min read
LW
link
3
reviews
Risks from Learned Optimization: Introduction
evhub
,
Chris van Merwijk
,
Vlad Mikulik
,
Joar Skalse
and
Scott Garrabrant
31 May 2019 23:44 UTC
184
points
42
comments
12
min read
LW
link
3
reviews
Rule Thinkers In, Not Out
Scott Alexander
27 Feb 2019 2:40 UTC
221
points
67
comments
4
min read
LW
link
4
reviews
(slatestarcodex.com)
The Curse Of The Counterfactual
pjeby
1 Nov 2019 18:34 UTC
124
points
34
comments
19
min read
LW
link
1
review
From Personal to Prison Gangs: Enforcing Prosocial Behavior
johnswentworth
24 Jan 2019 18:07 UTC
146
points
26
comments
5
min read
LW
link
2
reviews
Integrity and accountability are core parts of rationality
habryka
15 Jul 2019 20:22 UTC
153
points
66
comments
6
min read
LW
link
1
review
Selection vs Control
abramdemski
2 Jun 2019 7:01 UTC
168
points
25
comments
11
min read
LW
link
2
reviews
Alignment Research Field Guide
abramdemski
8 Mar 2019 19:57 UTC
264
points
9
comments
17
min read
LW
link
2
reviews
Tal Yarkoni: No, it’s not The Incentives—it’s you
Zack_M_Davis
11 Jun 2019 7:09 UTC
89
points
119
comments
1
min read
LW
link
4
reviews
(www.talyarkoni.org)
Back to top
Next