Bezzi comments on Uncompetitive programming with GPT-3

Bezzi 9 Feb 2022 19:47 UTC
1 point
0
GPT-3 wasn’t used as a base model for AlphaCode
I had missed this step. Retrospectively it should have been obvious… of course that you don’t start from a huge text predictor model to build a code predictor model that only needs to predict compilable code. Thanks for the clarification.
- gwern 9 Feb 2022 22:00 UTC
  3 points
  0
  Parent
  I think the fact that GPT-3 is controlled by OpenAI and AlphaCode is a DeepMind project has more to do with it. Of course you don’t need to hotstart by transfer learning, but it’s a good idea anyway if you can, which is why DM not using its own GPT-3-equivalent (Gopher, trained at considerable expense) has drawn comment.