michaelcohen comments on Reward learning summary