Reason #79 why language models will be hard to train: one of the webpages in your dataset is just a couple of forum comments and then 60000 repetitions of “lesswrong cult.”
Reason #79 why language models will be hard to train: one of the webpages in your dataset is just a couple of forum comments and then 60000 repetitions of “lesswrong cult.”