I think such a system where an AI in a sandboxed and airgapped environment is tasked with achieving a state where it is shut down at least once, while trying to overcome the guardrails and restrictions might prove quite usefil in finding out the weakspots in our barriers and then improving them.
I think such a system where an AI in a sandboxed and airgapped environment is tasked with achieving a state where it is shut down at least once, while trying to overcome the guardrails and restrictions might prove quite usefil in finding out the weakspots in our barriers and then improving them.