This comment feels like it’s confusing strategies with goals? That is, I wouldn’t normally think of “exploration” as something that an agent had as a goal but as a strategy it uses to achieve its goals. And “let’s try out a different utility function for a bit” is unlikely to be a direction that a stable agent tries exploring in.
This comment feels like it’s confusing strategies with goals? That is, I wouldn’t normally think of “exploration” as something that an agent had as a goal but as a strategy it uses to achieve its goals. And “let’s try out a different utility function for a bit” is unlikely to be a direction that a stable agent tries exploring in.