Yeah, this is the largest failure of the AI Alignment community so far, since we do have a viable boxing method already: Simboxing.
To be fair here, one large portion of the problem is that OpenAI and other AI companies would probably resist this, and they still want to have the most capabilities that they can get, which is anathema to boxing. Still, it’s the largest failure I saw so far.
Yeah, this is the largest failure of the AI Alignment community so far, since we do have a viable boxing method already: Simboxing.
To be fair here, one large portion of the problem is that OpenAI and other AI companies would probably resist this, and they still want to have the most capabilities that they can get, which is anathema to boxing. Still, it’s the largest failure I saw so far.
Link below on Simboxing:
https://www.lesswrong.com/posts/WKGZBCYAbZ6WGsKHc/love-in-a-simbox-is-all-you-need