I don’t think I’ve heard people calling 1 wireheading. It certainly isn’t whatI have in mind when I hear the term.
For 2, I’m interested to see a non-embedded example. If the agent can tamper the input to its reward function, wouldn’t that make it embedded?
I don’t think I’ve heard people calling 1 wireheading. It certainly isn’t whatI have in mind when I hear the term.
For 2, I’m interested to see a non-embedded example. If the agent can tamper the input to its reward function, wouldn’t that make it embedded?