Scott Garrabrant comments on Maximize Worst Case Bayes Score

Scott Garrabrant 19 Jun 2014 8:29 UTC
1 point
0
I have I already presented this to Abram Demski, and he and I have been working together on trying to prove my conjecture. (He and I are both in Los Angeles, and coincidentally are interested in the same question, so it is likely to be the direction that the MIRIxLosAngeles workshop continues to focus on.)

Your proposal is equivalent to Abram’s proposal. We believe the two distributions are not the same. I think we checked this for some small finite analogue.

Your “general” setting does not seem that much more general to me, It seems like it is pretty much identical, only reworded in terms of set theory instead of logic. There is one way in which it is more general. In my system, the set of subsets must be closed under union, intersection, and complement, and this is not true for a general collection of subsets. However, my construction does not work without this assumption. I actually use the fact that \mu is nowhere zero, and not being closed under union intersection and complement is kind of liking having some sets have measure 0.

I think the language of subsets instead of logic makes things a little easier to think about for some people. I think I prefer the logic language. (However when I wanted to think about small finite examples I would sometimes think about it in the subset language instead)
- Squark 19 Jun 2014 11:00 UTC
  2 points
  0
  Parent
  Firstly, I’d love to see the counterexample for the distributions being the same.
  
  Secondly, are you sure \mu nowhere zero is essential? Intuitively, your uniqueness result must work whenever for every two models M1, M2 there is a sentence \phi separating them with \mu(\phi) non-zero. But I haven’t checked it formally.
  - Scott Garrabrant 19 Jun 2014 19:32 UTC
    0 points
    0
    Parent
    At very least my conjecture is not true if \mu is not nowhere zero, which was enough for me to ignore that case, because (see my response to cousin_it) what I actually believe is that there are three very different definitions that all give the same distribution, which I think makes the distribution stand out a lot more as a good idea. Also if \mu is sometimes zero, we also lose uniqueness because we dont know what to do with sets that our Bayes score does not care about. The fact that we can do whatever we want with these sets also takes away coherence (although maybe we could artificially require coherence) I really don’t want to do that, because the whole reason I like this approach so much is that it didnt require coherence and coherence came out for free.
    
    Well for example, if Si only has one set A, Abram will think we are in that set, I will think we are in that set with probability ¹⁄₂. Now, you could require that every sentence has the same \mu as its negation, (corresponding to putting the sentence or its negation in with probability ¹⁄₂ in Abram’s procedure) in which case partition X into 3 sets, A, B, and C for which the in or not in A question is given weight muA, and similarly define muB and muC.
    
    Let muA=1/2, muB=1/4 and muC=1/4.
    
    Abrams procedure will with probability ¹⁄₄ choose A first, probability ¹⁄₈ choose B first, probability ¹⁄₈ choose C first, with probability ¹⁄₈ choose not A first then choose B with probability ¹⁄₈ choose not A first then choose C with probability ¹⁄₁₆ choose not C first and end up with B, with probability ¹⁄₁₆ choose not B first then end up with C, and with probability ¹⁄₈ choose not B or not C first then end up with A. In the end P(A) is .375.
    
    Notice that Abrams solution gives a different Bayes score when in set A than when in the other 2 sets. My Bayes score will not. My bayes score will give P(A) probability p where the bayes score is constant:
    
    ¹⁄₂ log p+1/4 log (1-(1-p)/2)+1/4 log (1-(1-p)/2)=1/2 log (1-p)+1/4 log ((1-p)/2)+1/4 log (1-(1-p)/2)
    
    2 log p+log (1-(1-p)/2)=2 log (1-p)+ log ((1-p)/2)
    
    p^2(1-(1-p)/2)=(1-p)^2((1-p)/2)
    
    p^2(1+p)=(1-p)^3
    
    p=.39661
    
    If you check this p value, you should see that bayes score is independent of model.