Rohin Shah comments on Equilibrium and prior selection problems in multipolar deployment

Rohin Shah 6 Apr 2020 3:19 UTC
LW: 2 AF: 2
0
AF
I think there are theorems to be proven, just not of the form “there is an optimal thing to do”
I meant one thing and wrote another; I just meant to say that there weren’t theorems in this post.
If the CAIS view multi-agent setups like this could be inevitable.
My point is just that “prior / equilibrium selection problem” is a subset of the “you don’t know everything about the other player” problem, which I think you agree with?
It’s also, to a first approximation, the strategy society takes in lots of situations, this happens whenever people form teams with a common goal. There are usually processes of re-negotiating the goal, but between these times of conflict people gain a lot of efficiency by working together and punishing deviation.
I’m not sure how this relates to the thing I’m saying (I’m also not sure if I understood it).
- MichaelDennis 8 Apr 2020 1:22 UTC
  LW: 1 AF: 1
  0
  AF Parent
  My point is just that “prior / equilibrium selection problem” is a subset of the “you don’t know everything about the other player” problem, which I think you agree with?
  I see two problems: one of trying to coordinate on priors, and one of trying to deal with having not successfully coordinated. I think that which is easier depends on the problem: if we’re applying it to CAIS, HRI or a multipolar scenario. Sometimes it’s easier to coordinate on a prior before hand, sometimes it’s easier to be robust to differing priors, and sometimes you have to go for a bit of both. I think it’s reasonable to call both solution techniques to the “prior / equilibrium selection problem”, but the framings shoot for different solutions, both of which I view as necessary sometimes.
  
  The strategy of agreeing on a joint welfare function is already a heuristic and isn’t an optimal strategy; it feels very weird to suppose that initially a heuristic is used and then we suddenly switch to pure optimality.
  I don’t really know what you mean by this. Specifically I don’t know from who’s perspective it isn’t optimal and under what beliefs.
  A few things to point out:
  - The strategy of agreeing on a joint welfare function and optimizing it is an optimal strategy for some belief in infinitely iterated settings (because there is a folk theorem so almost everything is an optimal strategy for some belief)
  - Since we’re currently making norms for these interactions, we are currently designing these beliefs. This means that we can make it be the case that having that belief is justified in future deployments.
  - If we want to talk about “optimality” in terms of “equilibria selection procedures” or “coordination norms” we have to have a metric to say some outcomes are “better” than others. This is not a utility function for the agents, but for us as the norm designers. Social welfare seems good for this.