“Random goals” is a crux. Complicated goals that we can’t control well enough to prevent takeover are not necessarily uniformly random goals from whatever space you have in mind.
“Random goals” is a crux. Complicated goals that we can’t control well enough to prevent takeover are not necessarily uniformly random goals from whatever space you have in mind.