DanielLC comments on [SEQ RERUN] Dreams of Friendliness

DanielLC 20 Aug 2012 4:38 UTC
0 points
What if you program the AI so that, once a question is asked, it can no longer take outside information into account to answer it and it assumes that its hardware will act how it’s supposed to. It won’t build an unfriendly oracle, because it can’t ask it questions. It won’t upgrade its own hardware, because that’s only useful insomuch as it makes it act differently than it’s currently supposed to. It won’t even waste resources making sure it doesn’t have a run-time error, because it’s assuming that it won’t. The only way it will waste resources is taking too long to answer, and you could stop that just by giving it a time limit.

It won’t give a very good answer if you don’t give it enough data, but you could just plug in more data and ask again. Just hope nobody asks it how to find the data necessary to get it to answer a question, because it will likely tell them to build an unsafe oracle to answer that question.

This seems to me to be part of a more general idea of making an AI that is not purely consequentialist. It can clearly be done. Humans aren’t pure consequentialists. You have to make it sufficiently consequentialist to be intelligent, without being so consequentialist as to be dangerous. It could be a very thick line, but there may be no easy way to find out without building the AI and seeing if you crossed it.