scasper comments on Penalize Model Complexity Via Self-Distillation