Donald Hobson comments on Idealized Agents Are Approximate Causal Mirrors (+ Radical Optimism on Agent Foundations)

Donald Hobson 31 Dec 2023 11:19 UTC
LW: 2 AF: 1
0
AF
You are making the structure of time into a fundamental part of your agent design, not a contingency of physics.
Let an aput be an input or an output. Let an policy be a subset of possible aputs. Some policies are physically valid.
Ie a policy must have the property that, for each input, there is a single output. If the computer is reversible, the policy must be a bijection from inputs to outputs. If the computer can create a contradiction internally, stopping the timeline, then a policy must be a map from inputs to at most one output.
If the agent is actually split into several pieces with lightspeed and bandwidth limits, then the policy mustn’t use info it can’t have.
But these physical details don’t matter.
The agent has some set of physically valid policies, and it must pick one.