sairjy comments on GPT-4

sairjy 18 Mar 2023 20:47 UTC
4 points
0
Yeah agree, I think it would make sense that’s trained on 10x-20x the amount of tokens of GPT-3 so around 3-5T tokens (2x-3x Chinchilla) and that would give around 200-300b parameters giving those laws.