I guess yeah. The more general point is that AIs get good at something when they have a lot of training data for it. Have many texts or pictures from the internet = learn to make more of these. So to get a real world optimizer you “only” need a lot of real world reinforcement learning, which thankfully takes time.
It’s not so rosy though. There could be some shortcut to get lots of training data, like AlphaZero’s self play but for real world optimization. Or a shortcut to extract the real world optimization powers latent in the datasets we already have, like “write a conversation between smart people planning to destroy the world”. Scary.
I guess yeah. The more general point is that AIs get good at something when they have a lot of training data for it. Have many texts or pictures from the internet = learn to make more of these. So to get a real world optimizer you “only” need a lot of real world reinforcement learning, which thankfully takes time.
It’s not so rosy though. There could be some shortcut to get lots of training data, like AlphaZero’s self play but for real world optimization. Or a shortcut to extract the real world optimization powers latent in the datasets we already have, like “write a conversation between smart people planning to destroy the world”. Scary.