gwern comments on Distinguishing test from training