Misha Ramendik comments on Narrow Misalignment is Hard, Emergent Misalignment is Easy

Misha Ramendik 12 Oct 2025 0:13 UTC
1 point
0
Wait, is this the solution to catastrophic forgetting in fine-tuning? I mean your KL regularisation math.