Algon comments on Explaining grokking through circuit efficiency

Algon 9 Sep 2023 11:00 UTC
LW: 5 AF: 4
1
AF
ungrokking, in which a network regresses from perfect to low test accuracy
Is this the same thing as catastrophic forgetting?
- Rohin Shah 10 Sep 2023 8:07 UTC
  LW: 5 AF: 4
  0
  AF Parent
  From page 6 of the paper:
  Ungrokking can be seen as a special case of catastrophic forgetting (McCloskey and Cohen, 1989; Ratcliff, 1990), where we can make much more precise predictions. First, since ungrokking should only be expected once $D^{'} < D_{c r i t}$ , if we vary $D^{'}$ we predict that there will be a sharp transition from very strong to near-random test accuracy (around $D_{c r i t}$ ). Second, we predict that ungrokking would arise even if we only remove examples from the training dataset, whereas catastrophic forgetting typically involves training on new examples as well. Third, since $D_{c r i t}$ does not depend on weight decay, we predict the amount of “forgetting” (i.e. the test accuracy at convergence) also does not depend on weight decay.
  (All of these predictions are then confirmed in the experimental section.)