If we are talking about goal definition evaluating AI (and Paul was probably thinking in the context of some sort of indirect normativity), “control” seems like a reasonable fit. The primary philosophical issue for that part of the problem is decision theory.
(I agree that it’s a bad term for referring to FAI itself, if we don’t presuppose a method of solution that is not Friendliness-specific.)
If we are talking about goal definition evaluating AI (and Paul was probably thinking in the context of some sort of indirect normativity), “control” seems like a reasonable fit. The primary philosophical issue for that part of the problem is decision theory.
(I agree that it’s a bad term for referring to FAI itself, if we don’t presuppose a method of solution that is not Friendliness-specific.)