Lao Mein comments on Subliminal Learning: LLMs Transmit Behavioral Traits via Hidden Signals in Data

Lao Mein 27 Jul 2025 12:08 UTC
6 points
0
It appears that Qwen 2.5 uses forced single-digit tokenization for its tokenizer (there are only 2 multi-integer tokens in the entire tokenizer, and they’re both double-width integers ([77150, ‘１０’], [80091, ‘２０’])). I assume that GPT 4.1 nano uses the o200k tokenizer, which includes all integers up to 999 as tokens. Is it possible that this played a major role in the lack of information transfer between different models? Have you tried using models with equivalent integer tokenizations?
- Owain_Evans 27 Jul 2025 16:37 UTC
  7 points
  2
  Parent
  We observe lack of transfer between GPT-4.1, GPT-4.1 mini, and GPT-4.1. nano, which use the same tokenizer. The other authors my have takes on the specific question you raise. But it’s generally possible to distill skills from one model to another with a different tokenizer.