Quintin Pope comments on [AN #173] Recent language model results from DeepMind

Quintin Pope 21 Jul 2022 16:10 UTC
2 points
0
This still seems confusing to me. Rohin says that the model is overtrained (not something like “prior approaches overtrained on limited data”), so it seems like he’s talking about the parameters and not the data.
- Rohin Shah 21 Jul 2022 16:13 UTC
  3 points
  0
  Parent
  Yeah I meant undertrained, I’ve fixed it now.