When choosing between policies that have the same actions, I prefer the policies that are simpler.
Could you elaborate on this? I feel like there’s a tension between “which policy is computationally simpler for me to execute in the moment?” and “which policy is more easily predicted by the agents around me?”, and it’s not obvious which one you should be optimizing for. [Like, predictions about other diachronic people seem more durable / easier to make, and so are easier to calculate and plan around.] Or maybe the ‘simple’ approaches for one metric are generally simple on the other metric.
My feeling is that I don’t have a strong difference between them. In general simpler policies are both easier to execute in the moment and also easier for others to simulate.
The clearest version of this is to, when faced with a decision, decide on an existing principle to apply before acting, or else define a new principle and act on this.
Principles are examples of short policies, which are largely path-independent, which are non-narrative, which are easy to execute, and are straightforward to communicate and be simulated by others.
Could you elaborate on this? I feel like there’s a tension between “which policy is computationally simpler for me to execute in the moment?” and “which policy is more easily predicted by the agents around me?”, and it’s not obvious which one you should be optimizing for. [Like, predictions about other diachronic people seem more durable / easier to make, and so are easier to calculate and plan around.] Or maybe the ‘simple’ approaches for one metric are generally simple on the other metric.
My feeling is that I don’t have a strong difference between them. In general simpler policies are both easier to execute in the moment and also easier for others to simulate.
The clearest version of this is to, when faced with a decision, decide on an existing principle to apply before acting, or else define a new principle and act on this.
Principles are examples of short policies, which are largely path-independent, which are non-narrative, which are easy to execute, and are straightforward to communicate and be simulated by others.