I don’t have a very strong view about the distinction between corrigibility to the user and corrigibility to some other definition of value (e.g. a hypothetical version of the user who is more secure).
I don’t understand this statement, in part because I have little idea what “corrigibility to some other definition of value” means, and in part because I don’t know why you bring up this distinction at all, or what a “strong view” here might be about.
I don’t understand this statement, in part because I have little idea what “corrigibility to some other definition of value” means, and in part because I don’t know why you bring up this distinction at all, or what a “strong view” here might be about.