research_prime_space comments on Penalize Model Complexity Via Self-Distillation