Vanessa Kosoy comments on Vanessa Kosoy’s Shortform

Vanessa Kosoy 5 Nov 2021 10:24 UTC
LW: 3 AF: 1
0
AF
One of the postulates of infra-Bayesianism is the maximin decision rule. Given a crisp infradistribution $Θ$ , it defines the optimal action to be:

$a^{*} (Θ) := arg max a min μ \in Θ E_{μ} [U (a)]$

Here $U$ is the utility function.

What if we use a different decision rule? Let $t \in [0, 1]$ and consider the decision rule

$a_{t}^{*} (Θ) := arg max a (t min μ \in Θ E_{μ} [U (a)] + (1 - t) max μ \in Θ E_{μ} [U (a)])$

For $t = 1$ we get the usual maximin (“pessimism”), for $t = 0$ we get maximax (“optimism”) and for other values of $t$ we get something in the middle (we can call “ $t$ -mism”).

It turns out that, in some sense, this new decision rule is actually reducible to ordinary maximin! Indeed, set

$μ_{t}^{*} := arg max μ E_{μ} [U (a_{t}^{*})]$

$Θ_{t} := t Θ + (1 - t) μ_{t}^{*}$

Then we get

$a^{*} (Θ_{t}) = a_{t}^{*} (Θ)$

More precisely, any pessimistically optimal action for $Θ_{t}$ is $t$ -mistically optimal for $Θ$ (the converse need not be true in general, thanks to the arbitrary choice involved in $μ_{t}^{*}$ ).

To first approximation it means we don’t need to consider $t$ -mistic agents since they are just special cases of “pessimistic” agents. To second approximation, we need to look at what the transformation of $Θ$ to $Θ_{t}$ does to the prior. If we start with a simplicity prior then the result is still a simplicity prior. If $U$ has low description complexity and $t$ is not too small then essentially we get full equivalence between “pessimism” and $t$ -mism. If $t$ is small then we get a strictly “narrower” prior (for $t = 0$ we are back at ordinary Bayesianism). However, if $U$ has high description complexity then we get a rather biased simplicity prior. Maybe the latter sort of prior is worth considering.
What links here?
- Cole Wyeth 24 Mar 2025 18:08 UTC
  LW: 4 AF: 3
  0
  AF Parent
  This called a Hurwicz decision rule / criterion (your t is usually alpha).
  I think the content of this argument is not that maxmin is fundamental, but rather that simplicity priors “look like” or justify Hurwicz-like decision rules. Simple versions of this are easy to prove but (as far as I know) do not appear in the literature.
  - Vanessa Kosoy 25 Mar 2025 8:23 UTC
    LW: 2 AF: 2
    0
    AF Parent
    Thanks for this!
    What I was saying up there is not a justification of Hurwicz’ decision rule. Rather, it is that if you already accept the Hurwicz rule, it can be reduced to maximin, and for a simplicity prior the reduction is “cheap” (produces another simplicity prior).
    Why accept the Hurwicz’ decision rule? Well, at least you can’t be accused of a pessimism bias there. But if you truly want to dig deeper, we can start instead from an agent making decisions according to an ambidistribution, which is a fairly general (assumption-light) way of making decisions. I believe that a similar argument (easiest to see in the LF-dual cramble set representation) would allow reducing that to maximin on infradistributions (credal sets).
    To make such an approach even more satisfactory, it would be good to add a justification for a simplicity ambi/infra-prior. I think this should be possible by arguing from “opinionated agents”: the ordinary Solomonoff prior is the unique semicomputable one that dominates all semicomputable measures, which decision-theoretically corresponds to something like “having preferences about as many possible worlds as we can”. Possibly, the latter principle formalized can be formalized into something which ends up picking out an infra-Solomonoff prior (and, replacing “computability” by a stronger condition, some other kind of simplicity infra-prior).