RSS

Bayesian Utility: Rep­re­sent­ing Prefer­ence by Prob­a­bil­ity Measures

Vladimir_Nesov27 Jul 2009 14:28 UTC
44 points
36 comments2 min readLW link

Cryp­to­graphic Boxes for Un­friendly AI

paulfchristiano18 Dec 2010 8:28 UTC
48 points
162 comments5 min readLW link

An­thropic de­ci­sion the­ory I: Sleep­ing beauty and selflessness

Stuart_Armstrong1 Nov 2011 11:41 UTC
22 points
34 comments2 min readLW link

Harsanyi’s So­cial Ag­gre­ga­tion The­o­rem and what it means for CEV

AlexMennen5 Jan 2013 21:38 UTC
37 points
88 comments4 min readLW link

SUDT: A toy de­ci­sion the­ory for up­date­less anthropics

Benya23 Feb 2014 23:50 UTC
27 points
14 comments8 min readLW link

Sin­gle player ex­ten­sive-form games as a model of UDT

cousin_it25 Feb 2014 10:43 UTC
25 points
26 comments2 min readLW link

Siren wor­lds and the per­ils of over-op­ti­mised search

Stuart_Armstrong7 Apr 2014 11:00 UTC
73 points
417 comments7 min readLW link

Wel­come!

Benya_Fallenstein4 Nov 2014 3:20 UTC
9 points
0 comments2 min readLW link

Ex­ploit­ing EDT

Benya_Fallenstein10 Nov 2014 19:59 UTC
20 points
0 comments2 min readLW link

Pre­dic­tors that don’t try to ma­nipu­late you(?)

Benya_Fallenstein15 Nov 2014 5:53 UTC
3 points
0 comments7 min readLW link

Main vs. Discussion

Benya_Fallenstein15 Nov 2014 6:35 UTC
0 points
0 comments1 min readLW link

An op­ti­mal­ity re­sult for modal UDT

Benya_Fallenstein15 Nov 2014 6:38 UTC
5 points
0 comments6 min readLW link

Main vs. Discussion

Benya_Fallenstein15 Nov 2014 6:38 UTC
0 points
0 comments1 min readLW link

A primer on prov­abil­ity logic

Benya_Fallenstein15 Nov 2014 6:39 UTC
3 points
0 comments4 min readLW link

Sim­plic­ity pri­ors with re­flec­tive oracles

Benya_Fallenstein15 Nov 2014 6:39 UTC
1 point
0 comments6 min readLW link

Or­a­cle ma­chines in­stead of topolog­i­cal truth predicates

Benya_Fallenstein15 Nov 2014 6:39 UTC
2 points
0 comments7 min readLW link

Topolog­i­cal truth pred­i­cates: Towards a model of perfect Bayesian agents

Benya_Fallenstein15 Nov 2014 6:39 UTC
6 points
0 comments9 min readLW link

Approximability

abramdemski17 Nov 2014 0:41 UTC
3 points
0 comments2 min readLW link

Stable self-im­prove­ment as a re­search problem

paulfchristiano17 Nov 2014 17:51 UTC
7 points
0 comments7 min readLW link

Trust­wor­thy au­to­mated philos­o­phy?

Benya_Fallenstein21 Nov 2014 2:57 UTC
6 points
0 comments9 min readLW link