Not my area of expertise, but I’m given to understand that training an AI “in the box” can itself be dangerous. While training GPT-3, you’re running an optimization algorithm that’s trying to minimize a cost function. If that optimizer is powerful enough, it might do things you don’t want (break out of the box) in order to lower the value of the function. Being in an air-gapped or internet-isolated environment might prevent damage, but I don’t think it prevents the optimizer from developing in ways that could be harmful.
Not my area of expertise, but I’m given to understand that training an AI “in the box” can itself be dangerous. While training GPT-3, you’re running an optimization algorithm that’s trying to minimize a cost function. If that optimizer is powerful enough, it might do things you don’t want (break out of the box) in order to lower the value of the function. Being in an air-gapped or internet-isolated environment might prevent damage, but I don’t think it prevents the optimizer from developing in ways that could be harmful.
I think you’re reading way too much into a simple minim-finding script: https://github.com/kingoflolz/mesh-transformer-jax/blob/master/device_train.py