Oliver Sourbut comments on Goodhart’s Law in Reinforcement Learning

Oliver Sourbut 17 Oct 2023 21:03 UTC
2 points
0
I still think this is an important point, and I’ve been thinking there should be a bloggy write-up of the maths in this area on LW/AF! Maybe you (or I, or Jacek, or Charlie, or Joar, or whoever...) could make that happen.

The original EPIC definition, and the STARC defs, can be satisfied while yielding only a pseudometric on the quotient space. But they also include many full (quotient) metrics, and the (kinda default?) L2 choice (assuming full-support weighting) yields a full metric.