Can comments on Understanding mesa-optimization using toy models