It can be as easy as creating a pdf of your post and submitting it (although if your post was written in LaTeX, they’ll want the tex file). If everything goes well, this takes less than an hour.
Hilariously, this does not work. I converted my Grokking post to a PDF (very crudely—just printing to PDF) and uploaded that, and it was rejected:
Dear author,
Thank you for submitting your work to arXiv. We regret to inform you that arXiv’s moderators have determined that your submission will not be accepted and made public on[ |http://arxiv.org][arXiv.org|http://arxiv.org].
In this case, our moderators have determined that your submission is a content type that arXiv does not accept:
I should say formatting is likely a large contributing factor for this outcome. Tom Dietterich, an arXiv moderator, apparently had a positive impression of the content of your grokking analysis. However, research on arXiv will be more likely to go live if it conforms to standard (ICLR, NeurIPS, ICML) formatting and isn’t a blogpost automatically exported into a TeX file.
I agree that formatting is the most likely issue.
The content of Neel’s grokking work is clearly suitable for arXiv (just very solid ML work).
And the style of presentation of the blog post is already fairly similar to a standard paper (e.g. is has an Introduction section, lists contributions in bullet points, …).
So yeah, I agree that formatting/layout probably will do the trick (including stuff like academic citation style).
I did Print to PDF in Word after formatting my Word document to look like a standard LaTeX-exported document, it had no problem going through! But might depend on the particular moderator.
Hilariously, this does not work. I converted my Grokking post to a PDF (very crudely—just printing to PDF) and uploaded that, and it was rejected:
I should say formatting is likely a large contributing factor for this outcome. Tom Dietterich, an arXiv moderator, apparently had a positive impression of the content of your grokking analysis. However, research on arXiv will be more likely to go live if it conforms to standard (ICLR, NeurIPS, ICML) formatting and isn’t a blogpost automatically exported into a TeX file.
I agree that formatting is the most likely issue. The content of Neel’s grokking work is clearly suitable for arXiv (just very solid ML work). And the style of presentation of the blog post is already fairly similar to a standard paper (e.g. is has an Introduction section, lists contributions in bullet points, …).
So yeah, I agree that formatting/layout probably will do the trick (including stuff like academic citation style).
I did Print to PDF in Word after formatting my Word document to look like a standard LaTeX-exported document, it had no problem going through! But might depend on the particular moderator.
Ah, sorry to hear. I wouldn’t have predicted this from reading arXiv’s content moderation guidelines.