I don’t think this demonstration truly captures treacherous turns, precisely because the agent needs to learn about how it can misbehave over multiple trials. As I understand it, a treacherous turn involves the agent modeling the environment sufficiently well that it can predict the payoff of misbehaving before taking any overt actions. The Goertzel prediction is what is happening here.
It’s important to start getting a grasp on how treacherous turns may work, and this demonstration helps; my disagreement is on how to label it.
Currently we can access all course materials at once. For the time being, it might be better to hide the incomplete bits so nobody can wander ahead and miss things. Slash, it might be better to force users to try one section before unlocking the next; otherwise people might eternally put off the hard sections.
That said, the platform looks new so it might not support this.