David Matolcsi comments on Training a Reward Hacker Despite Perfect Labels