Rohin Shah comments on Full toy model for preference learning