Archive
Sequences
About
Search
Log In
Home
Featured
All
Tags
Recent
Comments
Questions
Events
Shortform
Alignment Forum
AF Comments
RSS
New
Hot
Active
Old
Page
1
Bayesian Utility: Representing Preference by Probability Measures
Vladimir_Nesov
27 Jul 2009 14:28 UTC
48
points
37
comments
2
min read
LW
link
Cryptographic Boxes for Unfriendly AI
paulfchristiano
18 Dec 2010 8:28 UTC
71
points
162
comments
5
min read
LW
link
Anthropic decision theory I: Sleeping beauty and selflessness
Stuart_Armstrong
1 Nov 2011 11:41 UTC
22
points
34
comments
2
min read
LW
link
Harsanyi’s Social Aggregation Theorem and what it means for CEV
AlexMennen
5 Jan 2013 21:38 UTC
37
points
90
comments
4
min read
LW
link
SUDT: A toy decision theory for updateless anthropics
Benya
23 Feb 2014 23:50 UTC
27
points
14
comments
8
min read
LW
link
Single player extensive-form games as a model of UDT
cousin_it
25 Feb 2014 10:43 UTC
26
points
26
comments
2
min read
LW
link
Siren worlds and the perils of over-optimised search
Stuart_Armstrong
7 Apr 2014 11:00 UTC
83
points
418
comments
7
min read
LW
link
Welcome!
Benya_Fallenstein
4 Nov 2014 3:20 UTC
9
points
2
comments
2
min read
LW
link
Exploiting EDT
Benya_Fallenstein
10 Nov 2014 19:59 UTC
25
points
10
comments
2
min read
LW
link
Predictors that don’t try to manipulate you(?)
Benya_Fallenstein
15 Nov 2014 5:53 UTC
3
points
1
comment
7
min read
LW
link
Main vs. Discussion
Benya_Fallenstein
15 Nov 2014 6:35 UTC
0
points
1
comment
1
min read
LW
link
An optimality result for modal UDT
Benya_Fallenstein
15 Nov 2014 6:38 UTC
11
points
0
comments
6
min read
LW
link
Main vs. Discussion
Benya_Fallenstein
15 Nov 2014 6:38 UTC
4
points
0
comments
1
min read
LW
link
A primer on provability logic
Benya_Fallenstein
15 Nov 2014 6:39 UTC
8
points
3
comments
4
min read
LW
link
Simplicity priors with reflective oracles
Benya_Fallenstein
15 Nov 2014 6:39 UTC
1
point
0
comments
6
min read
LW
link
Oracle machines instead of topological truth predicates
Benya_Fallenstein
15 Nov 2014 6:39 UTC
2
points
13
comments
7
min read
LW
link
Topological truth predicates: Towards a model of perfect Bayesian agents
Benya_Fallenstein
15 Nov 2014 6:39 UTC
14
points
8
comments
9
min read
LW
link
Approximability
abramdemski
17 Nov 2014 0:41 UTC
3
points
2
comments
2
min read
LW
link
Stable self-improvement as a research problem
paulfchristiano
17 Nov 2014 17:51 UTC
8
points
7
comments
7
min read
LW
link
Trustworthy automated philosophy?
Benya_Fallenstein
21 Nov 2014 2:57 UTC
6
points
3
comments
9
min read
LW
link
Back to top
Next