Gordon Seidoh Worley comments on Model Mis-specification and Inverse Reinforcement Learning