simeon_c comments on Are Mixture-of-Experts Transformers More Interpretable Than Dense Transformers?