That is a good question. I don’t think it is essential that the agent can move from s1 to s2, only that the agent is able to force a stay in s2 if it wants to.
The transition from s1 to s2 could instead happen randomly with some probability.
The important thing is that the human’s action in s1 does not reveal any information about s2.
That is a good question. I don’t think it is essential that the agent can move from s1 to s2, only that the agent is able to force a stay in s2 if it wants to.
The transition from s1 to s2 could instead happen randomly with some probability.
The important thing is that the human’s action in s1 does not reveal any information about s2.