Stuart_Armstrong comments on (C)IRL is not solely a learning process

Stuart_Armstrong 19 Sep 2016 18:08 UTC
0 points
0
AF
Even worse, having the conservation of expected evidence for every action sequence is not enough to make the AI behave well. Jessica’s example of an AI that (to re-use the “human says” example for the moment) forces the human to randomly answer a question, has conservation of expected evidence… But not the other properties we want, such as conditional conservation of expected evidence (this is related to the ultra-sophisticated Cake or Death problem).