Torture in 1 world where the Evil AI is released, but removing more than dust specks in the remaining worlds? Just give me the parchment that I can sign with my blood! :D
And by the way, how will we check whether the produced code is really a friendly AI?
We can’t. And even an AI with no terminal values in other branches will still want to control them in order to increase utility in the branch it does through various indirect means, such as conterfactual trade, if that’s cheap, which it will be in any setup a human can think of.
Torture in 1 world where the Evil AI is released, but removing more than dust specks in the remaining worlds? Just give me the parchment that I can sign with my blood! :D
And by the way, how will we check whether the produced code is really a friendly AI?
We can’t. And even an AI with no terminal values in other branches will still want to control them in order to increase utility in the branch it does through various indirect means, such as conterfactual trade, if that’s cheap, which it will be in any setup a human can think of.