1stuserhere comments on Are Mixture-of-Experts Transformers More Interpretable Than Dense Transformers?