Cool direction and results!
Can we prevent such an agent from having a preference to create agents that do resist shutdown?
EDIT: And if they’re going to create agents anyway, actually make sure those agents don’t resist shutdown, too, rather than, say, being indifferent about whether those other agents resist shutdown.
What decisions do you think this has affected and what would you estimate the differences in outcomes to be as a result? Or, say, the most important impacts?