Zach Stein-Perlman comments on AI companies’ eval reports mostly don’t support their claims