AprilSR comments on An Impossibility Proof Relevant to the Shutdown Problem and Corrigibility

AprilSR 2 May 2023 21:56 UTC
7 points
11
Even if shut down in particular isn’t something we want it to be indifferent to, I think being able to make an agent indifferent to something is very plausibly useful for designing it to be corrigible?