I think this frame is subtly off. Something more like: using what feels good as evidence about what sort of thing you are.
Or, dis-identifying with a part of yourself often hides a tacit stronger identification with a different part of yourself, and is thus part of your ongoing war against yourself. This process is unavoidable, most of us have to pass through building a spiritual identity on the way towards a more widespread dis-identification.
> Seeking pleasure as a goal is goodharting.
I think this frame is subtly off. Something more like: using what feels good as evidence about what sort of thing you are.
Or, dis-identifying with a part of yourself often hides a tacit stronger identification with a different part of yourself, and is thus part of your ongoing war against yourself. This process is unavoidable, most of us have to pass through building a spiritual identity on the way towards a more widespread dis-identification.
x
Yeah, that’s part of it, plus having some model uncertainty about thinking of major parts of your system as a reinforcement learner.