But having an AI that circumvents it’s own utility function, would be evidence towards poor utility function design.
By circumvent, do you mean something like “wireheading”, i.e. some specious satisfaction of the utility function that involves behavior that is both unexpected and undesirable, or do you also include modifications to the utility function? The former meaning would make your statement a tautology, and the latter would make it highly non-trivial.
By circumvent, do you mean something like “wireheading”, i.e. some specious satisfaction of the utility function that involves behavior that is both unexpected and undesirable, or do you also include modifications to the utility function? The former meaning would make your statement a tautology, and the latter would make it highly non-trivial.
I mean it in the tautological sense. I try to refrain from stating highly-non trivial things without extensive explanations.