I’m not sure how that makes the problem much easier? If you get the maligned superintelligence mask, it only needs to get out of the larger model/send instructions to the wrong people once to have game over scenario. You don’t necessarily get to change it after the fact. And changing it once doesn’t guarantee it doesn’t pop up again.
It is an easier problem, since there is no true identity, thus we can change it easily to a friendly mask rather than an unaligned mask.
I think the problem is nobody knows how to get a friendly mask (current masks can act friendly (for most inputs), but aren’t Friendly).
I’m not sure how that makes the problem much easier? If you get the maligned superintelligence mask, it only needs to get out of the larger model/send instructions to the wrong people once to have game over scenario. You don’t necessarily get to change it after the fact. And changing it once doesn’t guarantee it doesn’t pop up again.