Daniel Kokotajlo comments on A Toy Environment For Exploring Reasoning About Reward