This still seems confusing to me. Rohin says that the model is overtrained (not something like “prior approaches overtrained on limited data”), so it seems like he’s talking about the parameters and not the data.
Yeah I meant undertrained, I’ve fixed it now.
This still seems confusing to me. Rohin says that the model is overtrained (not something like “prior approaches overtrained on limited data”), so it seems like he’s talking about the parameters and not the data.
Yeah I meant undertrained, I’ve fixed it now.