Walter Laurito comments on Try training token-level probes