interstice comments on Scott Aaronson on “Reform AI Alignment”

interstice 23 Nov 2022 4:57 UTC
3 points
5
The AI-box experiment and this result are barely related at all—the only connection between them is that they both involve deception in some manner. “There will exist future AI systems which sometimes behave deceptively” can hardly be considered a meaningful advance prediction.