cousin_it comments on Want to predict/explain/control the output of GPT-4? Then learn about the world, not about transformers.

cousin_it 16 Mar 2023 19:55 UTC
2 points
0
I guess yeah. The more general point is that AIs get good at something when they have a lot of training data for it. Have many texts or pictures from the internet = learn to make more of these. So to get a real world optimizer you “only” need a lot of real world reinforcement learning, which thankfully takes time.

It’s not so rosy though. There could be some shortcut to get lots of training data, like AlphaZero’s self play but for real world optimization. Or a shortcut to extract the real world optimization powers latent in the datasets we already have, like “write a conversation between smart people planning to destroy the world”. Scary.