In principle, this agent could create a less time bounded sub agent, right? It’s not clear where the incentive to do so is, but it wouldn’t appear to be disincentivised in the same way as remaining on (or more generally, it seems like it can exploit channels for future influence, a special case of which is sub agents).
Maybe what I’m trying to say is that it looks like a lot of the magic has to be in the shutdown function (this might be a problem for other agent shutdown proposals too).
…actually, maybe not. If the reward is only for satisfying the needs of another agent with a worse ability to see the future, then maybe the time bonded agent’s future influence is limited by the dumber agent’s ability to see the future
In principle, this agent could create a less time bounded sub agent, right? It’s not clear where the incentive to do so is, but it wouldn’t appear to be disincentivised in the same way as remaining on (or more generally, it seems like it can exploit channels for future influence, a special case of which is sub agents).
Maybe what I’m trying to say is that it looks like a lot of the magic has to be in the shutdown function (this might be a problem for other agent shutdown proposals too).
…actually, maybe not. If the reward is only for satisfying the needs of another agent with a worse ability to see the future, then maybe the time bonded agent’s future influence is limited by the dumber agent’s ability to see the future