Qiaochu_Yuan comments on Prisoner’s Dilemma (with visible source code) Tournament

Qiaochu_Yuan 5 Jun 2013 19:38 UTC
22 points
0
Injecting some keywords: this field of study is known as program equilibrium. Previous LW post on the subject, with links.

Edit: Can you explain how you decided on the parts of the payoff matrix involving “other”? These seem quite important as they affect the viability of strategies based on either convincing your opponent not to halt or not halting yourself.
What links here?
- Announcing the 2014 program equilibrium iterated PD tournament by tetronian2 (31 Jul 2014 12:24 UTC; 38 points)
- AlexMennen 5 Jun 2013 21:52 UTC
  11 points
  0
  Parent
  The payoffs for “other” were designed so that neither failing to halt, nor convincing the other player not to halt, should ever be a worthwhile strategy. If you don’t halt, it gives you the same payoff as if you had cooperated, and gives the other player the same payoff as if you had defected. That way, not halting should be strictly dominated by defecting, since you are better off if you defect, and the other player should react the same way to each threat. And tricking the other player into not halting is also a bad idea, since the payoff you get from it is the same as if they defected.
  - wedrifid 11 Jun 2013 8:27 UTC
    6 points
    0
    Parent
    
    The payoffs for “other” were designed so that neither failing to halt, nor convincing the other player not to halt, should ever be a worthwhile strategy. If you don’t halt, it gives you the same payoff as if you had cooperated, and gives the other player the same payoff as if you had defected.
    
    Your game world implements an “enthusiastic consent” policy
  - ThrustVectoring 6 Jun 2013 12:43 UTC
    6 points
    0
    Parent
    
    And tricking the other player into not halting is also a bad idea, since the payoff you get from it is the same as if they defected.
    
    (Defect, non-halt) is actually better than (Defect, Defect) for you, since it gives you a relative advantage over competitors in the tournament.
    - AlexMennen 6 Jun 2013 16:03 UTC
      2 points
      0
      Parent
      True, but I still don’t expect it will be a big problem. If there are a lot of submissions, the effect will be small, and if it is paying enough attention to your source code for it to be possible to trick it into not halting, then it is probably looking for a way to achieve mutual cooperation, so tricking it is still not a good strategy.
      - Gurkenglas 7 Jun 2013 14:47 UTC
        4 points
        0
        Parent
        If you trust all sufficiently smart players to try to induce (Defect, non-halt) if (Defect, Defect) is otherwise inevitable, the effect adds up over a hopefully significant portion of the submissions.
- lukeprog 5 Jun 2013 22:28 UTC
  9 points
  0
  Parent
  Handy introductory article: Computation and the Prisoner’s Dilemma.
  - Shmi 5 Jun 2013 23:21 UTC
    11 points
    0
    Parent
    
    If HisProgram == MyProgram then
    do (C);
    else
    do (D);
    end.
    
    TIL that ethnic hatred and tribalism is a Nash (folk) equilibrium.
    - SilasBarta 6 Jun 2013 21:58 UTC
      3 points
      0
      Parent
      Make sure the equality comparison only depends on things that affect functionality—i.e. it will declare any functionally equivalent programs equal even they use a different nameset for variables or something.
      
      (Yes, I know that’s reducible to the halting problem; in practice, you’ll use a computable, polynomial time approximation for it that will inevitably have to throw out equivalent programs that are too complex or otherwise be too ‘clever’.)
      - Shmi 6 Jun 2013 22:20 UTC
        3 points
        0
        Parent
        Patrick discusses this issue here in some depth.
      - PeterisP 7 Jun 2013 13:38 UTC
        1 point
        0
        Parent
        It’s quite likely that the optimal behaviour should be different in case the program is functionally equal but not exactly equal.
        
        If you’re playing yourself, then you want to cooperate.
        
        If you’re playing someone else, then you’d want to cooperate if and only if that someone else is smart enough to check if you’ll cooperate; but if it’s decision doesn’t depend on yours, then you should defect.
    - ThrustVectoring 6 Jun 2013 2:35 UTC
      0 points
      0
      Parent
      You only need to evaluate the equivalence of the first two lines of the program, by the way. It cooperates with those who can’t not cooperate with it if it goes through that branch of the logic, and does something else to everyone else.
    - iopq 6 Jun 2013 1:40 UTC
      −1 points
      0
      Parent
      Can you write that in Scheme so I can submit this? Thanks
- Discredited 7 Jun 2013 11:52 UTC
  0 points
  0
  Parent
  So as not to duplicate efforts: I have emailed Moshe Tennenholtz and Michael Wooldridge with invitations to play.