tom4everitt comments on CIRL Wireheading

tom4everitt 8 Aug 2017 6:29 UTC
0 points
0
AF
That is a good question. I don’t think it is essential that the agent can move from $s_{1}$ to $s_{2}$ , only that the agent is able to force a stay in $s_{2}$ if it wants to.

The transition from $s_{1}$ to $s_{2}$ could instead happen randomly with some probability.

The important thing is that the human’s action in $s_{1}$ does not reveal any information about $s_{2}$ .