martinkunev comments on Clarifying wireheading terminology

martinkunev 20 Mar 2026 17:09 UTC
1 point
0
I don’t think I’ve heard people calling 1 wireheading. It certainly isn’t whatI have in mind when I hear the term.
For 2, I’m interested to see a non-embedded example. If the agent can tamper the input to its reward function, wouldn’t that make it embedded?