reinthal comments on Monitoring benchmark for AI control