GenericModel comments on Why AI Evaluation Regimes are bad