I don’t want to seem like an Ant shill, but I must confess that at least a month before this post came out, I had a private conversation with one of their RL environment vendors about how Ant had put out new rules to prevent some of the behaviors mentioned above. And a few days after the post came out, Opus 4.7 was released, and I saw & confirmed that it improved on some of the behaviors mentioned (particularly the only-solving-a-portion-of-the-work problem). Was limited positive evidence about their alignment team’s ability & inclination to notice and fix value problems with each generation.
I don’t want to seem like an Ant shill, but I must confess that at least a month before this post came out, I had a private conversation with one of their RL environment vendors about how Ant had put out new rules to prevent some of the behaviors mentioned above. And a few days after the post came out, Opus 4.7 was released, and I saw & confirmed that it improved on some of the behaviors mentioned (particularly the only-solving-a-portion-of-the-work problem). Was limited positive evidence about their alignment team’s ability & inclination to notice and fix value problems with each generation.