Victor Levoso comments on 200 COP in MI: Interpreting Reinforcement Learning