Stuart_Armstrong comments on Wireheading is in the eye of the beholder

Stuart_Armstrong 30 Jan 2019 20:50 UTC
5 points
0
Maybe we can define wireheading as a subset of goodharting, in a way similar to what you’re defining.

However, we need the extra assumption that putting the reward on the maximal level is not what we actually desire; the reward function is part of the world, just as the AI is.
- avturchin 30 Jan 2019 22:41 UTC
  2 points
  0
  Parent
  Yes, that is what I meant.