TurnTrout comments on Asymptotically Unambitious AGI

TurnTrout 7 Mar 2019 1:54 UTC
LW: 2 AF: 1
0
AF

Other algorithms… would eventually seek arbitrary power in the world in order to intervene in the provision of its own reward; this follows straightforwardly from its directive to maximize reward

The conclusion seems false; AUP (IJCAI, LW) is a reward maximizer which does not exhibit this behavior. For similar reasons, the recent totalitarian convergence conjecture made here also seems not true.
- michaelcohen 7 Mar 2019 5:44 UTC
  LW: 3 AF: 2
  0
  AF Parent
  AUP seems really promising. I just meant other algorithms that have been proven generally intelligent, which is really just AIXI, the Thompson Sampling Agent, BayesExp, and a couple other variants on Bayesian agents with large model classes.