shminux comments on Causal decision theory is unsatisfactory

shminux 13 Sep 2014 19:28 UTC
15 points
I suspect that CDT seems not suitable for Newcomb-like problems because it tends to be applied to non-existent outcomes. If the outcome is not in the domain, you should not be calculating its utility. In the PD example CD and DC are not valid outcomes for clones. Similarly, two-boxing and getting $1001k is not a valid outcome in Newcomb. If you prune the decision tree of imaginable but non-existing branches before applying a decision theory, many differences between CDT and EDT tend to go away.
- lackofcheese 14 Sep 2014 2:13 UTC
  14 points
  Parent
  Moreover, if you prune the decision tree of all branches bar one then all decision algorithms will give the same (correct) answer!
  
  It’s totally OK to add a notion of pruning in, but you can’t really say that your decision algorithm of “CDT with pruning” makes sense unless you can specify which branches ought to be pruned, and which ones should not. Also, outright pruning will often not work; you may only be able to rule out a branch as highly improbable rather than altogether impossible.
  
  In other words, “pruning” as you put it is simply the same thing as “recognizing logical connections” in the sense that So8res used in the above post.
  - private_messaging 14 Sep 2014 12:45 UTC
    3 points
    Parent
    Well, a decision theory presumably is applied to some model of the physics, so that your agent can for example conclude that jumping out of a 100th floor window would result in it hitting ground at a high velocity. Finding that a hypothetical outcome is physically impossible would fall within the purview of the model of physics.
- cousin_it 16 Sep 2014 16:18 UTC
  4 points
  Parent
  As you know, I’m interested in decision theories that work in completely deterministic worlds. What does “pruning” mean if only one outcome is logically possible?
  - shminux 16 Sep 2014 16:38 UTC
    2 points
    Parent
    Not one, multiple. For example In Newcomb’s you can still choose to one-box (you get $1M) or two-box (you get $1k). However, “two-box and $1001000” is not in the problem domain at all, just like killing the predictor and grabbing all its riches isn’t. Similarly, if you play a game of, say, chess, there are valid moves and invalid moves. When designing a chess program you don’t need to worry about an opponent making an invalid move. In the cloned PD example CD and DC are invalid moves. If an algorithm (decision theory) cannot filter them out automatically, you have to prune the list of all moves until only valid moves are left before applying it. I am surprised that this trivial observation is not completely obvious.
    - cousin_it 16 Sep 2014 21:52 UTC
      4 points
      Parent
      The problem is that, for a deterministic decision algorithm running in a deterministic world, only one outcome actually happens. If you want to define a larger set of “logically possible” outcomes, I don’t see a difference in principle between the outcome where your decision algorithm returns something it doesn’t actually return, and the outcome where 1=2 and pumpkins fall from the sky.
      
      You might say that outcomes are “possible” or “impossible” from the agent’s point of view, not absolutely. The agent must run some “pruning” algorithm, and the set of “possible” outcomes will be defined as the result of that. But then the problem is that the set of “possible” outcomes will depend on how exactly the “pruning” works, and how much time the agent spends on it. With all the stuff about self-fulfilling proofs in UDT, it might be possible to have an agent that hurts itself by overzealous “pruning”.
      - shminux 16 Sep 2014 22:18 UTC
        5 points
        Parent
        I must be missing something. Suppose you write a chess program. The part of it which determines which moves are valid is separate from the part which decides which moves are good. Does a chess bot not qualify as a “deterministic decision algorithm running in a deterministic world”?
        
        Or is the issue that there is an uncertainty introduced by the other player? Then how about a Rubik cube solver? Valid moves are separate from the moves which get you close to the final state. You never apply your optimizer to invalid moves, which is exactly what CDT does in Newcomb’s.