However, I didn’t assume that the USG stepped in and prevented this scenario. I thought that it’s OpenAI’s Model Spec, Claude’s Constitution or something similar that would prevent an aligned AI grom being used as an absolute shield, since SOTA Specs do arguably require the AI to be objective.
I don’t think it’s obviously appropriate for AI companies to decide exactly what reflection process and exposure to society people who use their AIs need to have. And people might prefer to use AIs that do conform to their beliefs if they are allowed to buy access to them. So I think you might have to ban such AIs in order to prevent people from using them.
However, I didn’t assume that the USG stepped in and prevented this scenario. I thought that it’s OpenAI’s Model Spec, Claude’s Constitution or something similar that would prevent an aligned AI grom being used as an absolute shield, since SOTA Specs do arguably require the AI to be objective.
I don’t think it’s obviously appropriate for AI companies to decide exactly what reflection process and exposure to society people who use their AIs need to have. And people might prefer to use AIs that do conform to their beliefs if they are allowed to buy access to them. So I think you might have to ban such AIs in order to prevent people from using them.