🤔 I think a challenge with testing the corrigibility of AI is that currently no AI system is capable of running autonomously. It’s always dependent on humans to decide to host it and query it, so you can always just e.g. pour a bucket of water on the computer running the query script to stop it. Of course force-stopping the AI may be economically unfavorable for the human, but that’s not usually considered the main issue in the context of corrigibility.
I usually find it really hard to imagine how the world will look like once economically autonomous AIs become feasible. If they become feasible, that is—while there are obviously places today where with better AI technology, autonomous AIs would be able to outcompete humans, it’s not obvious to me that autonomous AIs wouldn’t also be outcompeted by centralized human-controlled AIs. (After all, it could plausibly be more efficient to query some neural network running in a server somewhere than to bring the network with you on a computer to wherever the AI is operating, and in this case you could probably economically decouple running the server from running the AI.)
🤔 I think a challenge with testing the corrigibility of AI is that currently no AI system is capable of running autonomously. It’s always dependent on humans to decide to host it and query it, so you can always just e.g. pour a bucket of water on the computer running the query script to stop it. Of course force-stopping the AI may be economically unfavorable for the human, but that’s not usually considered the main issue in the context of corrigibility.
I usually find it really hard to imagine how the world will look like once economically autonomous AIs become feasible. If they become feasible, that is—while there are obviously places today where with better AI technology, autonomous AIs would be able to outcompete humans, it’s not obvious to me that autonomous AIs wouldn’t also be outcompeted by centralized human-controlled AIs. (After all, it could plausibly be more efficient to query some neural network running in a server somewhere than to bring the network with you on a computer to wherever the AI is operating, and in this case you could probably economically decouple running the server from running the AI.)