Andy Arditi comments on Mechanistically Eliciting Latent Behaviors in Language Models