David Africa comments on Subliminal Learning, the Lottery-Ticket Hypothesis, and Mode Connectivity

David Africa 7 Oct 2025 11:38 UTC
2 points
1
This seems right to me. But maybe the drift in distribution mainly affects certain parameters, and the divergence tokens affect a separate set of parameters (in early layers) s.t. the downstream effect still persists even after being in OOD
- Stephen Elliott 9 Oct 2025 11:22 UTC
  1 point
  0
  Parent
  From my understanding from this paper, lottery tickets are invariant to optimiser, datatype, and other model properties (in this experimental setting), suggesting lottery tickets encode some basic properties of the task.
  It seems unlikely lottery tickets based on fundamental task properties would change with continual learning without other problems emerging (catastrophic forgetting).