Another possible advantage of multipolar AI: if we have multiple AI’s of frontier capability levels, then (ignoring the effects of race dynamics), that increases the probability that one of them is well aligned. Similarly it increases the probability that one is badly aligned. However, there is likely a capability level of ASI that we humans would be unable to control or defeat, but where if we sided with one of two ASIs of that level that were having a conflict, our contribution could still swing the balance and affect which of them won. It also makes protocols like debate more convincing, if the debaters are AIs that are actually of unrelated design: they seem less likely to be colluding to deceive us.
Another possible advantage of multipolar AI: if we have multiple AI’s of frontier capability levels, then (ignoring the effects of race dynamics), that increases the probability that one of them is well aligned. Similarly it increases the probability that one is badly aligned. However, there is likely a capability level of ASI that we humans would be unable to control or defeat, but where if we sided with one of two ASIs of that level that were having a conflict, our contribution could still swing the balance and affect which of them won. It also makes protocols like debate more convincing, if the debaters are AIs that are actually of unrelated design: they seem less likely to be colluding to deceive us.