1stuserhere comments on A framing for interpretability

1stuserhere 15 Nov 2023 12:42 UTC
2 points
0
It’s also worth noting that LLMs are not learning directly from the raw input stream but from a crux of that data (LLMs learn on compressed data) i.e. the LLMs are fed tokenized data, and the tokenizers act as compressors. This benefits the models by enabling them to have a more information-rich context.
- Fergus Fettes 27 Nov 2023 17:10 UTC
  1 point
  0
  Parent
  Would you say that tokenization is part of the architecture?
  And, in your wildest moments, would you say that language is also part of the architecture :)? I mean the latent space is probably mapping either a) brain states or b) world states right? Is everything between latent spaces architecture?