TurnTrout comments on Distillation Robustifies Unlearning

TurnTrout 23 Jun 2025 16:45 UTC
LW: 2 AF: 2
0
AF
In other words, “using unlearning techniques like GradDiff/MaxEnt during pretraining” might be a really powerful technique.
I have a cached thought that this was found to disrupt overall capabilities / make learning harder, but I don’t have a reference on hand.