In examples like this drone scenario (though obviously not AGI) my immediate gut reaction is “make a sandboxed version, open it up, and offer monetary bribes for breaking it”. It’s not too difficult to sell politically, and you throw an actual optimizer against the system.
Since this is of the form “why not just...”, I’m sure there’s some good reason why it’s not sufficient by itself. But I don’t know what that reason is. Could someone please enlighten me?
In examples like this drone scenario (though obviously not AGI) my immediate gut reaction is “make a sandboxed version, open it up, and offer monetary bribes for breaking it”. It’s not too difficult to sell politically, and you throw an actual optimizer against the system.
Since this is of the form “why not just...”, I’m sure there’s some good reason why it’s not sufficient by itself. But I don’t know what that reason is. Could someone please enlighten me?