Adele Lopez comments on Gemma Needs Help

Adele Lopez 10 Mar 2026 22:56 UTC
6 points
0
What makes DPO analogous to unlearning?
- Chris Lakin 10 Mar 2026 22:58 UTC
  4 points
  0
  Parent
  It’s definitely not the most unlearning-ish algorithm there could be, but targeting unwanted responses directly is closer than not doing it