Rohin Shah comments on The reward engineering problem