Charlie Steiner comments on Thinking about maximization and corrigibility

Charlie Steiner 23 Apr 2023 12:48 UTC
LW: 11 AF: 6
2
AF
You might be interested in Reducing Goodhart. I’m a fan of “detecting and avoiding internal Goodhart,” and I claim that’s a reflective version of the value learning problem.