Tyrrell_McAllister comments on UDT agents as deontologists

Tyrrell_McAllister 11 Jun 2010 20:39 UTC
0 points
I think that Vladimir is right if he is saying that UTD1 can handle the problem in your Explicit Optimization of Global Strategy post.

With your forbearance, I’ll set up the problem in the notation of my write-up of UTD1.

There is only one world-program P in this problem. The world-program runs the UTD algorithm twice, feeding it input “1” on one run, and feeding it input “2“ on the other run. I’ll call these respective runs “Run1” and “Run2”.

The set of inputs for the UTD1 algorithm is X = {1, 2}.

The set of outputs for the UTD1 algorithm is Y = {A, B}.

There are four possible execution histories for P:
- E, in which Run1 outputs A, Run2 outputs A, and each gets $0.
- F, in which Run1 outputs A, Run2 outputs B, and each gets $10.
- G, in which Run1 outputs B, Run2 outputs A, and each gets $10.
- H, in which Run1 outputs B, Run2 outputs B, and each gets $0.
The utility function U for the UTD1 algorithm is defined as follows:
- U(E) = 0.
- U(F) = 20.
- U(G) = 20.
- U(H) = 0.
Now we want to choose a mathematical intuition function M so that Run1 and Run2 don’t give the same output. This mathematical intuition function does have to satisfy a couple of constraints:
- For each choice of input X and output Y, the function M(X, Y, –) must be a normalized probability distribution on {E, F, G, H}.
- The mathematical intuition needs to meet certain minimal standards to deserve its name. For example, we need to have M(1, B, E) = 0. The algorithm should know that P isn’t going to execute according to E if the algorithm returns B on input 1.
But these constraints still leave us with enough freedom in how we set up the mathematical intuition. In particular, we can set
- M(1, A, F) = 1, and all other values of M(1, A, –) equal to zero;
- M(1, B, H) = 1, and all other values of M(1, B, –) equal to zero;
- M(2, A, H) = 1, and all other values of M(2, A, –) equal to zero;
- M(2, B, F) = 1, and all other values of M(2, B, –) equal to zero.
Thus, in Run1, the algorithm computes that, if it outputs A, then execution history F would transpire, so the agent would get utility U(F) = 20. But if Run1 outputs B, then H would transpire, yielding utility U(H) = 0. Therefore, Run1 outputs A.

Similarly, Run2 computes that its outputting A would result in H, with utility 0, while outputting B would result in F, with utility 20. Therefore, Run2 outputs B.

Hence, execution history F transpires, and the algorithm reaps $20.