For what it’s worth, I would expect the behavior you described, and suspect a malicious explanation overall.
Anecdotally, Claude lies/cheats an enormous amount (far more than comparable frontier models).
For what it’s worth, I would expect the behavior you described, and suspect a malicious explanation overall.
Anecdotally, Claude lies/cheats an enormous amount (far more than comparable frontier models).