Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Power Buys You Distance From The Crime
Elizabeth
2 Aug 2019 20:50 UTC
214
points
75
comments
7
min read
LW
link
1
review
(acesounderglass.com)
Why Subagents?
johnswentworth
1 Aug 2019 22:17 UTC
176
points
48
comments
7
min read
LW
link
1
review
The Commitment Races problem
Daniel Kokotajlo
23 Aug 2019 1:58 UTC
169
points
56
comments
5
min read
LW
link
Soft takeoff can still lead to decisive strategic advantage
Daniel Kokotajlo
23 Aug 2019 16:39 UTC
122
points
47
comments
8
min read
LW
link
4
reviews
Subagents, trauma and rationality
Kaj_Sotala
14 Aug 2019 13:14 UTC
113
points
4
comments
19
min read
LW
link
Trauma, Meditation, and a Cool Scar
Logan Riggs
6 Aug 2019 16:17 UTC
102
points
17
comments
5
min read
LW
link
1
review
[Question]
Can we really prevent all warming for less than 10B$ with the mostly side-effect free geoengineering technique of Marine Cloud Brightening?
mako yass
5 Aug 2019 0:12 UTC
96
points
55
comments
2
min read
LW
link
Partial summary of debate with Benquo and Jessicata [pt 1]
Raemon
14 Aug 2019 20:02 UTC
89
points
63
comments
22
min read
LW
link
3
reviews
Subagents, neural Turing machines, thought selection, and blindspots
Kaj_Sotala
6 Aug 2019 21:15 UTC
87
points
3
comments
12
min read
LW
link
Troll Bridge
abramdemski
23 Aug 2019 18:36 UTC
86
points
59
comments
12
min read
LW
link
2-D Robustness
Vlad Mikulik
30 Aug 2019 20:27 UTC
86
points
8
comments
2
min read
LW
link
Problems in AI Alignment that philosophers could potentially contribute to
Wei Dai
17 Aug 2019 17:38 UTC
83
points
14
comments
2
min read
LW
link
Clarifying some key hypotheses in AI alignment
Ben Cottier
and
Rohin Shah
15 Aug 2019 21:29 UTC
79
points
12
comments
9
min read
LW
link
Markets are Universal for Logical Induction
johnswentworth
22 Aug 2019 6:44 UTC
77
points
2
comments
5
min read
LW
link
Six AI Risk/Strategy Ideas
Wei Dai
27 Aug 2019 0:40 UTC
73
points
17
comments
4
min read
LW
link
1
review
Classifying specification problems as variants of Goodhart’s Law
Vika
19 Aug 2019 20:40 UTC
72
points
5
comments
5
min read
LW
link
1
review
[Question]
Does Agent-like Behavior Imply Agent-like Architecture?
Scott Garrabrant
23 Aug 2019 2:01 UTC
69
points
8
comments
1
min read
LW
link
Response to Glen Weyl on Technocracy and the Rationalist Community
John_Maxwell
22 Aug 2019 23:14 UTC
66
points
9
comments
10
min read
LW
link
[Question]
Why so much variance in human intelligence?
Ben Pace
22 Aug 2019 22:36 UTC
65
points
28
comments
4
min read
LW
link
Book Review: Secular Cycles
Scott Alexander
13 Aug 2019 4:10 UTC
62
points
10
comments
16
min read
LW
link
1
review
(slatestarcodex.com)
Dual Wielding
Zvi
27 Aug 2019 14:10 UTC
60
points
23
comments
2
min read
LW
link
3
reviews
(thezvi.wordpress.com)
How to Make Billions of Dollars Reducing Loneliness
John_Maxwell
30 Aug 2019 17:30 UTC
60
points
32
comments
7
min read
LW
link
Schelling Categories, and Simple Membership Tests
Zack_M_Davis
26 Aug 2019 2:43 UTC
60
points
10
comments
8
min read
LW
link
Tabooing ‘Agent’ for Prosaic Alignment
Hjalmar_Wijk
23 Aug 2019 2:55 UTC
57
points
10
comments
6
min read
LW
link
Actually updating
SaraHax
23 Aug 2019 17:46 UTC
56
points
10
comments
4
min read
LW
link
Intentional Bucket Errors
Scott Garrabrant
22 Aug 2019 20:02 UTC
55
points
6
comments
3
min read
LW
link
Computational Model: Causal Diagrams with Symmetry
johnswentworth
22 Aug 2019 17:54 UTC
53
points
29
comments
4
min read
LW
link
Zeno walks into a bar
lsusr
4 Aug 2019 7:00 UTC
53
points
4
comments
2
min read
LW
link
Permissions in Governance
sarahconstantin
2 Aug 2019 19:50 UTC
53
points
12
comments
8
min read
LW
link
(srconstantin.wordpress.com)
A Personal Rationality Wishlist
DanielFilan
27 Aug 2019 3:40 UTC
53
points
54
comments
4
min read
LW
link
(danielfilan.com)
AI Forecasting Dictionary (Forecasting infrastructure, part 1)
Bird Concept
and
Ben Goldhaber
8 Aug 2019 16:10 UTC
50
points
0
comments
5
min read
LW
link
Vaniver’s View on Factored Cognition
Vaniver
23 Aug 2019 2:54 UTC
48
points
4
comments
8
min read
LW
link
Status 451 on Diagnosis: Russell Aphasia
Zack_M_Davis
6 Aug 2019 4:43 UTC
48
points
1
comment
1
min read
LW
link
(status451.com)
Towards a mechanistic understanding of corrigibility
evhub
22 Aug 2019 23:20 UTC
47
points
26
comments
4
min read
LW
link
September Bragging Thread
Raemon
30 Aug 2019 21:58 UTC
47
points
12
comments
1
min read
LW
link
[Link] Book Review: Reframing Superintelligence (SSC)
ioannes
28 Aug 2019 22:57 UTC
46
points
9
comments
2
min read
LW
link
[Question]
How Can People Evaluate Complex Questions Consistently?
Elizabeth
26 Aug 2019 20:33 UTC
46
points
12
comments
1
min read
LW
link
New paper: Corrigibility with Utility Preservation
Koen.Holtman
6 Aug 2019 19:04 UTC
44
points
11
comments
2
min read
LW
link
Embedded Agency via Abstraction
johnswentworth
26 Aug 2019 23:03 UTC
42
points
20
comments
11
min read
LW
link
My recommendations for gratitude exercises
MaxCarpendale
5 Aug 2019 19:04 UTC
40
points
3
comments
5
min read
LW
link
The Missing Math of Map-Making
johnswentworth
28 Aug 2019 21:18 UTC
40
points
8
comments
2
min read
LW
link
LW Team Updates—September 2019
Ruby
29 Aug 2019 22:12 UTC
39
points
13
comments
2
min read
LW
link
Epistemic Spot Check: The Fate of Rome (Kyle Harper)
Elizabeth
24 Aug 2019 21:40 UTC
39
points
3
comments
5
min read
LW
link
(acesounderglass.com)
Call for contributors to the Alignment Newsletter
Rohin Shah
21 Aug 2019 18:21 UTC
39
points
0
comments
4
min read
LW
link
Cephaloponderings
Jacob Falkovich
4 Aug 2019 16:45 UTC
39
points
4
comments
7
min read
LW
link
Optimization Provenance
Adele Lopez
23 Aug 2019 20:08 UTC
38
points
5
comments
5
min read
LW
link
Unstriving
Jacob Falkovich
19 Aug 2019 14:31 UTC
38
points
7
comments
6
min read
LW
link
Diana Fleischman and Geoffrey Miller—Audience Q&A
Jacob Falkovich
10 Aug 2019 22:37 UTC
38
points
6
comments
9
min read
LW
link
When do utility functions constrain?
Hoagy
23 Aug 2019 17:19 UTC
37
points
8
comments
7
min read
LW
link
Mistake Versus Conflict Theory of Against Billionaire Philanthropy
Zvi
1 Aug 2019 13:10 UTC
37
points
34
comments
3
min read
LW
link
(thezvi.wordpress.com)
Back to top
Next