testingthewaters comments on Subliminal Learning, the Lottery-Ticket Hypothesis, and Mode Connectivity

testingthewaters 7 Oct 2025 5:24 UTC
4 points
0
I think a working continual learning implementation would mess with convergence-based results which have a relatively fixed training data distribution and only modify the starting seeds. This is mostly because a continual learning system is constantly drifting “off the base distribution” and incorporating new data. In other words, the car model has seen data from places and distributions the attacker’s base model never will.
- David Africa 7 Oct 2025 11:38 UTC
  2 points
  1
  Parent
  This seems right to me. But maybe the drift in distribution mainly affects certain parameters, and the divergence tokens affect a separate set of parameters (in early layers) s.t. the downstream effect still persists even after being in OOD
  - Stephen Elliott 9 Oct 2025 11:22 UTC
    1 point
    0
    Parent
    From my understanding from this paper, lottery tickets are invariant to optimiser, datatype, and other model properties (in this experimental setting), suggesting lottery tickets encode some basic properties of the task.
    It seems unlikely lottery tickets based on fundamental task properties would change with continual learning without other problems emerging (catastrophic forgetting).