Wei Dai comments on The strategy-stealing assumption

Wei Dai 24 Sep 2019 4:09 UTC
LW: 2 AF: 2
0
AF

I don’t have a very strong view about the distinction between corrigibility to the user and corrigibility to some other definition of value (e.g. a hypothetical version of the user who is more secure).

I don’t understand this statement, in part because I have little idea what “corrigibility to some other definition of value” means, and in part because I don’t know why you bring up this distinction at all, or what a “strong view” here might be about.