gwern comments on Extrapolating GPT-N performance