Decision-makers who need inspections to keep them in line are incentivized to subvert those inspections and betray the other party. It seems like what’s actually necessary is people in key positions who are willing to cooperate in the prisoner’s dilemma when they think they are playing against people like them—people who would cooperate even if there were no inspections.
But if there are inspections, then why do we need law-following AI? Why not have the inspections directly check that the AI would not harm the other party (hopefully because it would be helping humans in general)?
Decision-makers who need inspections to keep them in line are incentivized to subvert those inspections and betray the other party. It seems like what’s actually necessary is people in key positions who are willing to cooperate in the prisoner’s dilemma when they think they are playing against people like them—people who would cooperate even if there were no inspections.
But if there are inspections, then why do we need law-following AI? Why not have the inspections directly check that the AI would not harm the other party (hopefully because it would be helping humans in general)?