Charlie Steiner comments on Full toy model for preference learning