Aaron Sandoval comments on Monitoring benchmark for AI control