The claim isn’t that minds are safe and nice by default. It’s that they’re not sociopaths.
I thought one of the tenets of this debate is that there’s no in-between. Either safe and nice (aligned) or everybody dies (not aligned). Humans are a good example—most are not pure psychopaths, and yet they do a ton of harm to each other all the time, and have threatened to destroy the species for decades. A set of much more powerful minds with even that level of misalignment would be disaster, and if they’re slightly worse than humans, so much the worse.
I thought one of the tenets of this debate is that there’s no in-between. Either safe and nice (aligned) or everybody dies (not aligned). Humans are a good example—most are not pure psychopaths, and yet they do a ton of harm to each other all the time, and have threatened to destroy the species for decades. A set of much more powerful minds with even that level of misalignment would be disaster, and if they’re slightly worse than humans, so much the worse.