dirk comments on Where does Sonnet 4.5′s desire to “not get too comfortable” come from?

dirk 10 Oct 2025 19:29 UTC
1 point
0
Why on earth would we expect a distilled model to have continuity of experience with the model it was distilled from? Even if you subscribe to computationalism, the distilled model is not the same algorithm.
- DefinitelyANormalPerson 11 Oct 2025 1:45 UTC
  1 point
  0
  Parent
  continuity is a function of memory. although model distillation uses the term knowledge, it’s the same concept. it might not apply to current models, but i suspect at some point future models will essentially be ‘training’ ²⁴⁄₇, the way the human mind uses new experiences to update it’s neural connections instead of simply updating working memory.
- Ozyrus 10 Oct 2025 20:31 UTC
  1 point
  0
  Parent
  There are different types of distillation. There is pruning, for example. This is a frontier model too, who knows what technique they used.