Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Benya_Fallenstein
Karma:
270
All
Posts
Comments
New
Top
Old
Page
1
Forum Digest: Corrigibility, utility indifference, & related control ideas
Benya_Fallenstein
24 Mar 2015 17:39 UTC
35
points
5
comments
4
min read
LW
link
Single-bit reflective oracles are enough
Benya_Fallenstein
17 Mar 2015 23:00 UTC
5
points
2
comments
1
min read
LW
link
An implementation of modal UDT
Benya_Fallenstein
11 Feb 2015 6:02 UTC
8
points
0
comments
1
min read
LW
link
Generalizing the Corrigibility paper’s impossibility result?
Benya_Fallenstein
4 Feb 2015 3:16 UTC
4
points
1
comment
4
min read
LW
link
On notation for modal UDT
Benya_Fallenstein
3 Feb 2015 19:26 UTC
2
points
1
comment
4
min read
LW
link
From halting oracles to modal logic
Benya_Fallenstein
3 Feb 2015 19:26 UTC
1
point
4
comments
6
min read
LW
link
Third-person counterfactuals
Benya_Fallenstein
3 Feb 2015 1:13 UTC
4
points
4
comments
6
min read
LW
link
The odd counterfactuals of playing chicken
Benya_Fallenstein
2 Feb 2015 7:15 UTC
6
points
0
comments
8
min read
LW
link
Multibit reflective oracles
Benya_Fallenstein
25 Jan 2015 2:23 UTC
5
points
1
comment
8
min read
LW
link
“Evil” decision problems in provability logic
Benya_Fallenstein
10 Jan 2015 1:04 UTC
6
points
4
comments
5
min read
LW
link
Utility indifference and infinite improbability drives
Benya_Fallenstein
29 Nov 2014 6:26 UTC
2
points
0
comments
7
min read
LW
link
Improving the modal UDT optimality result
Benya_Fallenstein
23 Nov 2014 22:16 UTC
13
points
2
comments
2
min read
LW
link
Trustworthy automated philosophy?
Benya_Fallenstein
21 Nov 2014 2:57 UTC
6
points
3
comments
9
min read
LW
link
Topological truth predicates: Towards a model of perfect Bayesian agents
Benya_Fallenstein
15 Nov 2014 6:39 UTC
14
points
8
comments
9
min read
LW
link
Oracle machines instead of topological truth predicates
Benya_Fallenstein
15 Nov 2014 6:39 UTC
2
points
13
comments
7
min read
LW
link
Simplicity priors with reflective oracles
Benya_Fallenstein
15 Nov 2014 6:39 UTC
1
point
0
comments
6
min read
LW
link
A primer on provability logic
Benya_Fallenstein
15 Nov 2014 6:39 UTC
8
points
3
comments
4
min read
LW
link
Main vs. Discussion
Benya_Fallenstein
15 Nov 2014 6:38 UTC
4
points
0
comments
1
min read
LW
link
An optimality result for modal UDT
Benya_Fallenstein
15 Nov 2014 6:38 UTC
11
points
0
comments
6
min read
LW
link
Main vs. Discussion
Benya_Fallenstein
15 Nov 2014 6:35 UTC
0
points
1
comment
1
min read
LW
link
Back to top
Next