Lukas Finnveden comments on Distillation Robustifies Unlearning

Lukas Finnveden 16 Jun 2025 18:09 UTC
LW: 4 AF: 3
0
AF
This looks great.
Random thought: I wonder how iterating the noise & distill steps of UNDO (each round with small alpha) compares against doing one noise with big alpha and then one distill session. (If we hold compute fixed.)
Couldn’t find any experiments on this when skimming through the paper, but let me know if I missed it.
- Addie Foote 17 Jun 2025 16:38 UTC
  3 points
  0
  Parent
  Thanks for the thought! We did try different approaches to noise scheduling like you suggest. From what we tried, adding noise only once resulted in faster distillation for the same total amount of noise added/robustness gained. However, we didn’t run comprehensive experiments on it, so it’s possible a more experimentation would provide new insights.