Neel Nanda comments on Fabien’s Shortform

Neel Nanda 23 Mar 2025 21:06 UTC
LW: 3 AF: 2
0
AF
Are the joint names separated by spaces if not, the tokenization is going to be totally broken more generally I would be interested to see this Tried with a code that EG maps familiar tokens to obscure ones or something like mapping token with id k to id maximum minus K. Tokens feel like the natural way in llm would represent its processing and thus encoded processing. Doing things in individual letters is kind of hard
- Fabien Roger 24 Mar 2025 12:23 UTC
  LW: 2 AF: 2
  0
  AF Parent
  They were separated by spaces. (But I’d encourage replication before updating too hard on results which I think are very weird.)