ErickBall comments on Asymptotically Unambitious AGI

ErickBall 6 Mar 2019 14:51 UTC
LW: 3 AF: 1
0
AF
My concern is that since CDT is not reflectively stable, it may have incentives to create non-CDT agents in order to fulfill instrumental goals.
- Wei Dai 6 Mar 2019 22:54 UTC
  LW: 3 AF: 2
  0
  AF Parent
  If I understand correctly, it’s actually updateless within an episode, and that’s the only thing it cares about so I don’t see how it would not be reflectively stable. Plus, even if it had an incentive to create a non-CDT agent, it would have to do that by outputting some message to the operator, and the operator wouldn’t have the ability to create a non-CDT agent without leaving the room which would end the episode. (I guess it could hack the operator’s mind and create a non-CDT agent within, but at that point it might as well just make the operator give it max rewards.)
  - michaelcohen 7 Mar 2019 0:09 UTC
    LW: 1 AF: 1
    0
    AF Parent
    With the correction that it is updateless and CDT (see here), I agree with the rest of this.