ZankerH comments on Anomalous Tokens on Gemini 3.0 Pro

ZankerH 28 Jan 2026 6:53 UTC
5 points
0
When the phenomenon was first noticed (in GPT2 IIRC), the leading hypothesis was disjunct training sets between the tokenizer and language model—i.e., the anomalous tokens don’t appear in the training set or are poorly represented. It would be strange if this were still the case.