What’s your sense of how a bungled attempt at Friendliness compares to other things humans might do, in terms of how likely an AI would be to consider it a threat?
Fairly low. But that’s because I don’t think the first AIs are likely to be built by people trying to guarantee Friendliness. If a FriendlyAI proponent tries to rush to get done before another team could finish it could be a much bigger risk.
For my part, if I think about things people might do that might cause a powerful AI to feel threatened and thereby have significantly bad results, FAI theory and implementation not only doesn’t float to the top of the list, it’s hardly even visible in the hypothesis space (unless, as here, I privilege it inordinately by artificially priming it).
What’s your sense of how a bungled attempt at Friendliness compares to other things humans might do, in terms of how likely an AI would be to consider it a threat?
Fairly low. But that’s because I don’t think the first AIs are likely to be built by people trying to guarantee Friendliness. If a FriendlyAI proponent tries to rush to get done before another team could finish it could be a much bigger risk.
OK.
For my part, if I think about things people might do that might cause a powerful AI to feel threatened and thereby have significantly bad results, FAI theory and implementation not only doesn’t float to the top of the list, it’s hardly even visible in the hypothesis space (unless, as here, I privilege it inordinately by artificially priming it).