Oliver Sourbut comments on Goodhart’s Law in Reinforcement Learning