hairyfigment comments on Newcomb’s Problem and Regret of Rationality

hairyfigment 16 May 2011 22:57 UTC
0 points
0
Eliezer says elsewhere that current decision theory doesn’t let us prove a self-modifying AI would choose to keep the goals we program into it. He wants to develop a proof before even starting work on the AI.
- TimFreeman 16 May 2011 23:04 UTC
  3 points
  0
  Parent
  It’s easy to contrive situations where a self-modifying AI would choose not to keep the goals programmed into it, even without precommitment issues. Just contrive the circumstances so it gets paid to change. Unless there’s something wrong with the argument there, TDT etc. won’t be enough to ensure that the goals are kept.