ProgramCrafter comments on SAE on activation differences

ProgramCrafter 1 Jul 2025 19:00 UTC
1 point
0
Have you tried to compare the base model at different checkpoints (or even between versions) yet?
- Santiago Aranguri 8 Jul 2025 6:12 UTC
  1 point
  0
  Parent
  This is definitely a promising next direction. One lesson from working on the diff between chat and base is that the difference is not ‘localized’ enough: chat and base have too many differences. Taking checkpoints that are closer together can improve on this.