Stop can be done with thermodynamics and boundaries, I think? You need to be able to address all the locations the AI is implemented and require that their energy release goes to background. Still some hairy ingredients for asymptotic alignment, but not as bad as “fetch a coffee as fast as possible without that being bad”.
Stop can be done with thermodynamics and boundaries, I think? You need to be able to address all the locations the AI is implemented and require that their energy release goes to background. Still some hairy ingredients for asymptotic alignment, but not as bad as “fetch a coffee as fast as possible without that being bad”.