I’m mostly just rambling about stuff that is totally missing. Basically, I’m respectively referring to ‘Don’t explode the gas main to blow the people out of the burning building’, ‘Don’t wirehead’ and ‘How do you utilitarianism?’.
I understand. And if/when we crack those philosophical problems in a sufficiently general way, we will still be left with the technical problem of “how do we represent the relevant parts of reality and what we want out of it in a computable form so the AI can find the optimum”?
What is the distinction between these and A.2?
What are those, what is the minimum set of capabilities within that space that are needed for our goals, and why are they needed?
What is it, and why is it needed?
Is there any distinction, for the purposes of writing a world-saving AI?
If there is, it implies that the two will sometimes give conflicting answers. Is that something we would want to happen?
I’m mostly just rambling about stuff that is totally missing. Basically, I’m respectively referring to ‘Don’t explode the gas main to blow the people out of the burning building’, ‘Don’t wirehead’ and ‘How do you utilitarianism?’.
I understand. And if/when we crack those philosophical problems in a sufficiently general way, we will still be left with the technical problem of “how do we represent the relevant parts of reality and what we want out of it in a computable form so the AI can find the optimum”?