Callum Lawson comments on Monitoring benchmark for AI control