I had a vaguely similar thought at first, but upon some reflection found the framing insightful. I hadn’t really thought much about the “AI models might just get selected for the capability of resisting shutdown, whether they’re deliberate about this or not” hypothesis, and while it’s useful to distinguish the two scenarios, I’d personally rather see this as a special case of “resisting shutdown” than something entirely separate.
I had a vaguely similar thought at first, but upon some reflection found the framing insightful. I hadn’t really thought much about the “AI models might just get selected for the capability of resisting shutdown, whether they’re deliberate about this or not” hypothesis, and while it’s useful to distinguish the two scenarios, I’d personally rather see this as a special case of “resisting shutdown” than something entirely separate.