Vaniver comments on Notes on cooperating with unaligned AIs

Vaniver 24 Aug 2025 16:44 UTC
9 points
5
I think “you can’t get the coffee if you’re dead” goes thru just as well for “you can’t get the coffee if you’re preserved in a vault”; do you not?
- Cedar 25 Aug 2025 4:52 UTC
  1 point
  0
  Parent
  https://www.alignmentforum.org/posts/wnzkjSmrgWZaBa2aC/self-preservation-or-instruction-ambiguity-examining-the
  Yeah. Here’s a case where, rather than an intrinsic self-preservation drive, task-accomplishment drives self-preservation behavior.
  I’d imagine an AI given a task would similarly resist being vaulted because it would interfere with completing the task.