The AI-box experiment and this result are barely related at all—the only connection between them is that they both involve deception in some manner. “There will exist future AI systems which sometimes behave deceptively” can hardly be considered a meaningful advance prediction.
The AI-box experiment and this result are barely related at all—the only connection between them is that they both involve deception in some manner. “There will exist future AI systems which sometimes behave deceptively” can hardly be considered a meaningful advance prediction.