This is also my intuition. I think we’d need a better conceptual picture of corrigibility to say anything confident about this topic though.
To the extent there is agreement about the merits of developing a better conceptual picture of corrigibility, it seems like we should just work on that rather than trying to reconcile intuitions. If there is disagreement about the importance of improving our picture of corrigibility, that’s more likely to be worth reconciling.
This is also my intuition. I think we’d need a better conceptual picture of corrigibility to say anything confident about this topic though.
To the extent there is agreement about the merits of developing a better conceptual picture of corrigibility, it seems like we should just work on that rather than trying to reconcile intuitions. If there is disagreement about the importance of improving our picture of corrigibility, that’s more likely to be worth reconciling.