Well, he estimated the expected effect on risk as insignificant increase of risk. That is to me the strong point; the ‘does not reduce’ is a weak version prone to eliciting Pascal’s wager type response.
I am >.9 confident that donating money to SI doesn’t significantly increase human existential risk.
(Edit: Which, on second read, I guess means I agree with Holden as you summarize him here. At least, the difference between “A doesn’t significantly affect B” and “A insignificantly affects B” seems like a difference I ought not care about.)
I also think Pascal’s Wager type arguments are silly. More precisely, given how unreliable human intuition is when dealing with very low probabilities and when dealing with very large utilities/disutilities, I think lines of reasoning that rely on human intuitions about very large very-low-probability utility shifts are unlikely to be truth-preserving.
On that, I’m pretty sure that the SI would not rush that way. Consider the parable of the dragon. This isn’t the story of someone who’s willing to cut corners, but of someone who accepts that delays for checking, even delays that cause people to die, are necessary.
Plus, if they develop a clear enough architecture, so one can query what the AI is thinking, then one would be able to see potential future failures while still in testing, without having to have those contingencies actually occur. That will be one of the keys, I think. Make the AI’s reasons something that we can follow, even if we couldn’t generate those arguments on a reasonable time-frame.
Well, he estimated the expected effect on risk as insignificant increase of risk. That is to me the strong point; the ‘does not reduce’ is a weak version prone to eliciting Pascal’s wager type response.
I am >.9 confident that donating money to SI doesn’t significantly increase human existential risk.
(Edit: Which, on second read, I guess means I agree with Holden as you summarize him here. At least, the difference between “A doesn’t significantly affect B” and “A insignificantly affects B” seems like a difference I ought not care about.)
I also think Pascal’s Wager type arguments are silly. More precisely, given how unreliable human intuition is when dealing with very low probabilities and when dealing with very large utilities/disutilities, I think lines of reasoning that rely on human intuitions about very large very-low-probability utility shifts are unlikely to be truth-preserving.
Why do you want to know?
On that, I’m pretty sure that the SI would not rush that way. Consider the parable of the dragon. This isn’t the story of someone who’s willing to cut corners, but of someone who accepts that delays for checking, even delays that cause people to die, are necessary.
Plus, if they develop a clear enough architecture, so one can query what the AI is thinking, then one would be able to see potential future failures while still in testing, without having to have those contingencies actually occur. That will be one of the keys, I think. Make the AI’s reasons something that we can follow, even if we couldn’t generate those arguments on a reasonable time-frame.