Should we maximize the Geometric Expectation of Utility?
Consequentialists (including utilitarians) claim that the goodness of an action should be judged based on the goodness of its consequences. The word utility is often used to refer to the quantified goodness of a particular outcome. When the consequences of an action are uncertain, it is often taken for granted that consequentialists should choose the action which has the highest expected utility. The expected utility is the sum of the utilities of each possible outcome, weighted by their probability. For a lottery which gives outcome utilities with respective probabilities , the expected utility is:
There are several good reasons to use the maximization of expected utility as a normative rule. I’ll talk about some of them here, but I recommend Joe Carlsmith’s series of posts ‘On Expected Utility’ as a good survey.
Here, I’m going to consider what ethical decisions might look like if we instead chose to maximize the geometric expectation of utility (which I’ll also refer to as the geometric average), as given by the formula:
I’m going to look at a few reasons why the maximizing the geometric expectation of utility is appealing and some other reasons why it is less appealing.
Geometric Expectation Logarithmic Utility
I want to to get this out of the way before starting.
Maximizing the geometric expectation is mathematically equivalent to maximizing the expected value of the logarithm of utility[1]. This leads some people to use ‘geometric averaging’ and ‘logarithmic utility’ interchangeably. I don’t like this and I’ll explain why. First: just because they are equivalent mathematically, this doesn’t mean that they encode the same intuitions (as Scott Garrabrant writes: “you wouldn’t define as ” even though they give the same result). Writing the geometric expectation emphasises that wherever two terms are added in the expected value, they are multiplied in the geometric expectation.
Second: there are two ‘variables’ at play here: the utility function (which assigns a utility to each outcome) and the averaging method (which is used to decide between lotteries with uncertain outcomes). If Alice and Bob agree on the utilities of each outcome (ie. they have the same utility function) but Alice chooses to maximize expected utility and Bob chooses to maximize the geometric expectation, they will behave differently. It seems weird to say that Bob is really just maximizing logarithmic utility, since he and Alice both agreed on their utility functions beforehand. Choosing a utility function and choosing an averaging method (or another way of deciding in uncertain situations) are two different decisions that shouldn’t be smuggled together.
Finally: I prefer the fact that geometric averaging can more easily deal with outcomes of zero utility. Log(0) is not well-defined, but is. There are ways around this, but I find the geometric averaging approach more intuitive.
While reading this, I encourage you to view the geometric expectation as a different way of averaging and deciding between uncertain outcomes, not just a different utility function. If you find yourself thinking about the differences between geometric expected utility and expected utility in terms of utility functions, remind yourself that, for any non-negative utility function, one can choose either averaging method.
I’ll now survey some arguments for and against using the geometric expectation in ethical decision-making, compared to the expected value.
Arguments for using the Geometric Expectation
The Time-averaged Growth rate
Maximizing the geometric average is the same a maximizing the time-averaged multiplicative growth rate of utility.
If your initial utility is then getting a utility is the same as multiplying your utility by a factor . If a lottery has a set of utility payoffs with corresponding probabilities then you can equivalently view it as a lottery where the payoffs are given in terms of multipliers of your initial utility. Imagine repeating this lottery so that each time it is repeated, your current utility is multiplied by with probability . If is your utility after N repetitions of the lottery, then the average factor that your utility grows by each repetition is . If is the number of times that outcome occurs, then in the limit of large : . The time-averaged growth rate in the limit of an infinite number of repetitions is therefore:
Thus, maximizing the geometric average of utility can be view maximizing the time-averaged growth rate of your utility, if a lottery is repeated multiplicatively. Sometimes, a lottery might have a positive expected value, but a time-averaged growth rate of less than one, (see this footnote[2] for Ole Peters oft-repeated coin toss example of this phenomenon).
The Kelly Criterion
The Kelly Criterion is a strategy for sizing bets in gambles which is widely used by professional sports bettors and investors. It is equivalent to sizing bets such that they maximize the geometric expectation. It has been shown that a Kelly-bettor will, in the long-run, outperform bettors using any other ‘essentially different’ strategy (including expected value maximization) for sizing their bets. In particular the ratio between the bankroll of an agent using the Kelly strategy and the bankroll of an agent using a different strategy, this ratio will tend to infinity as the number of repeated bets tends to infinity. This holds even when the odds of the gambles change each repetition. See Kelly’s paper here or this paper by Edward Thorpe for proof of this, and some other, similar claims.
If you assume utility is unbounded, these arguments in favour of Kelly betting also count as arguments in favour of geometric expectation maximization. If you care about utility, then choosing the strategy which will, in the long run get you more utility than other strategies, then this is a pretty compelling reason to use that strategy. If utility is bounded, then the proofs are not as strong , but they still approximately hold in situations where the current utility is small compared to the maximum possible utility.
Intuitions around Extinction
Suppose you are a total utilitarian and you believe that the earth and the lives of its inhabitants is net positive. You also believe that there is no other life in the universe. Would you accept a 51% chance of creating a new, fully populated earth if there was a 49% chance of destroying this earth[3]? Setting aside considerations about the suffering involved if earth disappeared, an expected value-maximizer is obliged to accept this gamble. A geometric expectation maximizer is not. Personally, geometric expectation maximization fits my intuitions better in this situation.
Pascal’s Mugging
Consider the following lottery: there is a small probability of receiving a large utility payoff and a large probability of having to pay a small utility cost . The expected value of this lottery is . An expected utility maximizer will accept this lottery if this expression is positive, regardless of how small is made. When is very small, is very large and relatively small, this situation is sometimes called Pascal’s mugging.
Expected utility maximization compels one to accept Pascal’s mugging, but some find it unappealing [4]. Geometric expectation maximizers can also be Pascal-mugged, but are generally more reluctant to accept the gamble. For a starting utility , a geometric expectation maximizer will accept the Pascal-mugging if
Note that this diverges as (the cost of losing) approaches (your utility before the gamble). If , then there is no payoff which would justify risking accepting the gamble. It is harder to Pascal-mug a geometric utility maximizer.
Arguments Against using the Geometric Expectation
Violates Von Neumann-Morgenstern Rationality
A geometric utility maximizer rejects the VNM axiom of Continuity, which states that for any three lotteries with preference ordering , there must exist a probability such that . In words: there is some probability with which you can ‘mix’ and such that the resulting lottery is equally preferable to .
Geometric utility maximization rejects this axiom, since, if is a zero utility outcome, then the geometric expected utility of any lottery involving will also be zero, regardless of how large you make the payoff of . In terms of money: a geometric expectation maximizer will never accept the tiniest risk of absolute bankruptcy, even if it comes with an arbitrarily large probability of an arbitrarily large payoff.
Violating the Continuity Axiom is bad because it allows you to be money pumped. Violations of the other VNM axioms allow you to be money pumped (ie. accept a series of lotteries which are guaranteed to make you lose utility) with certainty, but violations of the continuity axiom can only make you worse off with arbitrarily high probability. If you refuse and instead pick , then you will end up worse off with probability , including when is really high. Furthermore, if is zero utility, only needs to be the tiniest bit above zero in order to get a geometric utility maximizer to choose it.
This is pretty bad, but is it much worse than accepting Pascal’s mugging? In Pascal’s mugging, you also accept a situation which is almost guaranteed (with arbitrarily high probability) to make you worse off. But people don’t refer to this as a money pump, as they think the small probability of very high utility compensates for this.
Expected value maximizers fanatically pursue high utility, geometric utility maximizers fanatically avoid low/zero utility. Both are willing to accept almost guaranteed losses in order to pursue these preferences. These seem unappealing in symmetrical ways. Neither (to my mind) comes out better in this comparison.
Another way of highlighting the violation of VNM rationality is point out that any lottery with any nonzero probability of zero utility has geometric expectation of zero, meaning that all such lotteries are equivalently desirable, according to geometric utility maximization. This is especially concerning if we, as good Bayesians, refuse to assign a zero probability to any event, including zero utility ones. This would make all real world lotteries indistinguishable.
There is simple workaround to this which is to treat a zero utility outcome as a finite utility , compare different lotteries by taking their ratio in the limit . This amounts to the rule “choose the lottery with the lowest probability of zero utility, if they have the same probability of zero utility, choose the lottery with the highest geometric expectation of the remaining nonzero utility outcomes”. Its a bit hacky but it works.
The Veil of Ignorance (aka the Original Position)
Arguments going back to Harsanyi[5] consider situations where rational humans (for some definition of ‘rational’) have to choose a course of action affecting a group, without knowing which position in the group they will occupy. Acting self-interestedly, they will choose the option with the highest expected utility. Carlsmith has pointed out that, if a large number of people are drowning and there are several lotteries with payoffs involving saving different numbers of people chosen at random, the lottery which gives each individual the best chance of surviving is also the lottery with the highest expected value of lives saved. Thus, if each person (self-interestedly) voted on a course of action, they would vote to maximize the expected value.
There are situations where maximizing the geometric expectation of lives saved will go against the votes of people behind the veil of ignorance. In many situations this is bad, however, the veil of ignorance is an intuition pump, not an infallible guide which applies in all situations.
Suppose there are only 1000 people left on earth and they are given a choice between two lotteries. In lottery A there is a 51% chance that they all survive and a 49% chance they all die. In lottery B, 500 of them are randomly chosen to survive and the others will die. Under the veil of ignorance, lottery A gives better individual odds of survival, but the geometric expectation favours lottery B. Intuitions pull differently for different people, but it is not clear to me that lottery B is obviously wrong.
Ensemble Averaging
If a large ‘ensemble’ of people all independently accept a lottery and agree to share any profits/losses between them equally, then the amount they will each receive will approach the expected value of the lottery (as the size of the ensemble approaches infinity and the law of large numbers applies). The geometric expectation of the lottery does not provide a guide to your utility in this situation. If people in the ensemble used the geometric expectation to choose their lotteries, they would all end up worse off.
In some ways, this is the counterpart to multiplicative time-averaging we encountered above. Both repeat the gamble many times independently, either sequentially or in parallel. When repeated multiplicatively in sequence, the geometric expectation is the best guide for your wealth, when repeated in parallel, in an ensemble, the expected value is better.
Background Independence
If you take a lottery with outcomes and you add a constant utility to each of them, the expected value of this new lottery is simply plus the expected value of the original lottery. If you are comparing multiple lotteries, the lottery with the highest expected utility doesn’t change if you add a constant to each outcome of each lottery. This property sometimes comes under the umbrella term of ‘background independence’. The preference ordering over lotteries (as decided by the expected value) is not affected by things that are unchanged by the outcome of the lotteries.
This is not the case for the geometric expectation, which is said to reject background independence. What matters for geometric maximization is the proportional change in utility, not the absolute change. For an expected utility maximizer, saving 10 lives is equivalent whether they are 10 people among 8 billion others, of whether they are the last 10 people in the universe. For a geometric utility maximizer, the latter situation represents a larger proportional change.
Rejecting background independence has been criticised by Wilkinson, in his paper ‘In Defence of Fanaticism’, using a variation of Parfit’s Egyptology argument. Roughly, if you reject background independence then there are some lotteries for which choosing between them entirely depends on your knowledge of the ‘background’ which is unaffected by the lotteries, rather than the outcomes that are actually at stake. For example, if you believe in assigning moral weight to aliens on the other side of the universe, then whether or not they exist would not affect the decisions made by an expected utility maximizer, but would affect the decisions made by a geometric utility maximizer. Thus a geometric utility maximizer could conceivably spend large amounts of resources researching astrobiology and distant galaxies in order to make a decision which only affects people on earth.
This is often taken to just be absurd, in the same way that Parfit’s average utilitarian who researches Egyptology in order to decide whether to have children is absurd. But to me its not so obvious. If we’re making decisions about a particular reference class, its not crazy that knowing more about that reference class will change the kind of decisions we make. Accepting a lottery which has a 10% chance of killing 10 people is more significant if there are only 10 people left on earth. Also, rejecting background independence is not unique to geometric averaging: it also applies to any expected utility maximizer who has a utility function which is nonlinear in any quantity (this applies to, for example, any VNM utility function, which by definition must be bounded and therefore nonlinear in some quantity).
Conclusion
When I started this piece, I was hoping that geometric utility maximization would prove to be a satisfactory replacement for expected utility maximization, about which I have some lingering dissatisfaction. Instead, like pushing around a lump under the carpet, it seems to resolve issues in some situations, which then pop up in a different form somewhere else. Geometric utility maximization fits some of my some of my intuitions regarding ethical decision making but not others. The same is true of expected utility maximization. Maybe searching for a version of consequentialism that fits all intuitions is hopeless. But I find viewing expected utility as the default ‘obviously correct’ option unappealing. I can imagine a world where people thought more in terms of the geometric expectation and geometric utility maximization was considered the default model of rational behaviour, as opposed to expected utility. It’s a bit weird but this imaginary world doesn’t look too crazy.
- ^Since log is a monotonically increasing function, and thus encodes the same preference ordering as . 
- ^Imagine that a fair coin will be tossed. If it lands heads, your utility will be multiplied by a factor 1.5. If tails, it will be multiplied by a factor of 0.6. Imagine that this lottery is repeated many times. What is the average factor that your utility will be multiplied by, each time? If your initial utility is and your utility after N repetitions is , then, on average your utility has be multiplied by a factor of each repetition. Call the number of times the coin lands heads and the number of times it lands tails. In the limit that N goes to infinity, invoking the law of large numbers we can say that will approach the 1⁄2 (ie. the probability that the coin lands heads). Thus, in the limit, the average factor that utility is multiplied by is . We call this the ‘time-averaged growth rate’. Note that this expression is the geometric expectation of the utility of the initial lottery, divided by the initial value of your utility. Thus, the geometric expectation of the lottery tells us the time averaged growth rate of the gamble if it is repeated multiplicatively. In this case, while the expected utility gain is positive (), if we repeat this gamble enough times, we are almost guaranteed to end up with lower utility. This example is often given by Ole Peters when discussing his ‘ergodicity economics’. 
- ^This is the question that Tyler Cowen asked Sam Bankman-Fried to understand his famously ‘risk-neutral’ approach to utilitarianism. SBF, as an expected utility maximizer, bit the bullet and said he would accept the gamble. 
- ^Expected utility advocates normally get around Pascal muggings by advocating for bounded utility functions. However, provided that utility is currently low compared to the upper bound, one can always come up with a lottery with a very small probability of a large utility payoff. The only way to avoid this is to say that utility is currently at a significant fraction of the upper bound (eg. you could say that it is impossible to increase utility from its current by more than a factor of 100x). To me, this seems to indicate a lack of imagination regarding how much better the world could be. 
- ^Cardinal Utility in Welfare Economics and in the Theory of Risk-taking (1953) paywalled link here. 
I think this misses out on the fact that utility is always indirect—there is a function from world-state to utility that each rational agent has, so there can never be a lottery that directly awards utility. Meaning you can model the utility valuation linearly, but the mapping of resources to utility with logarithmic declining marginal utility.
Thanks for this, I always thought this is a quite fundamental/important issue. I hope Scott Garrabrant chimes in.
Maximizing the geometric expectation makes a lot of sense when we interpret “utility” as measuring wealth or money. Losing all your wealth is obviously much worse than doubling your wealth is good. The geometric expectation accounts for this by making doubling and halving your wealth (at equal odds) cancel out in expectation.
But more often we mean with “utility” the degree of goodness or badness of an outcome (“welfare”), or how strongly we want it to be true or false (“degree of desire”). These values can arguably be both positive and negative. There seems to be no a priori lower bound on badness of an outcome (or how strongly we disvalue the outcome), just as there is no upper bound on its goodness (or how strongly we value it being true).
But the geometric expectation requires that utility is non-negative. Perhaps even positive, as the problems with zero utility show. Usually the geometric mean is only used for positive numbers.
Eric Neyman also made this point a while ago.
So my current take: Geometric expectation seems correct on utility as wealth, arithmetic expectation seems correct on utility as welfare, or utilities as degrees of desire (“values”). Though I haven’t yet checked how many of the issues you mention are solved by this.
By the way, you should crosspost this to the EA Forum because of its obvious application to ethics. There should be an option for that in the post options, though perhaps you need to link your EA Forum account first.
This seems misguided.
The normal VNM approach is to start with an agent whose behavior satisfies some common sense conditions: can’t be money pumped and so on. From that we can prove that the agent behaves as if maximizing the expectation of some function on outcomes, which we call the “utility function”. That function is not unique, you can apply an affine transform and obtain another utility function describing the same behavior. The behavior is what’s real; utility functions are merely our descriptions of it.
From that perspective, it makes no sense to talk about “maximizing the geometric expectation of utility”. Utility is, by definition, the function whose (ordinary, not geometric) expectation is maximized by your behavior. That’s the whole reason for introducing the concept of utility.
The mistake is a bit similar to how people talk about “caring about other people’s utility, not just your own”. You cannot care about other people’s utility at the expense of your own, it’s a misuse of terms. If your behavior is consistent, then the function that describes it is called “your utility”.
The word ‘utility’ can be used in two different ways: normative and descriptive.
You are describing ‘utility’ in the descriptive sense. I am using it in the normative sense. These are explained in the first paragraph of the Wikipedia page for ‘utility’.
As I explained in the opening paragraph, I’m using the word ‘utility’ to mean the goodness/desirability/value of an outcome. This is normative: if an outcome is ‘good’ then there is the implication that you ought to pursue it.
That makes me even more confused. Are you arguing that we ought to (1) assign some “goodness” values to outcomes, and then (2) maximize the geometric expectation of “goodness” resulting from our actions? But then wouldn’t any argument for (2) depend on the details of how (1) is done? For example, if “goodnesses” were logarithmic in the first place, then wouldn’t you want to use arithmetic averaging? Is there some description of how we should assign goodnesses in (1) without a kind of firm ground that VNM gives?
Without wishing to be facetious: how much (if any) of the post did you read? If you disagree with me, that’s fine, but I feel like I’m answering questions which I already addressed in the post!
I’m not arguing that we ought to maximize the geometric expectation of “goodness” resulting from our actions. I’m exploring what it might look like if we did. In the conclusion, (and indeed, many other parts of the post) I’m pretty ambivalent.
I don’t think so. I think you could have a preference ordering over ‘certain’ world states and the you are still left with choosing a method for deciding between lotteries where the outcome is uncertain. I describe that this is my position in the section titled ‘Geometric Expectation ≠ Logarithmic Utility’.
This is what philosophers of normative ethics do! People disagree on the how exactly to do it, but that doesn’t stop them from trying! My post tries to be agnostic as to what exactly it is we care about and how we assign utility to different world states, since I’m focusing on the difference between averaging methods.
Guilty as charged—I did read your post as arguing in favor of geometric averaging, when it really wasn’t. Sorry.
The main point still seems strange to me, though. Suppose you were programming a robot to act on my behalf, and you asked me to write out some goodness values for outcomes, to program them into the robot. Then before writing out the goodnesses I’d be sure to ask you: which method would the robot use for evaluating lotteries over outcomes? Depending on that, the goodness values I’d write for you (to achieve the desired behavior from the robot) would be very different.
To me it suggests that the goodness values and the averaging method are not truly independent degrees of freedom. So it’s simpler to nail down the averaging method, to use ordinary arithmetic averaging, and then assign the goodness values. We don’t lose any ability to describe behavior (as long as it’s consistent), and we remain with only the degree of freedom that actually matters.
(apologies for taking a couple of days to respond, work has been busy)
I think your robot example nicely demonstrates the difference between our intuitions. As cubefox pointed out in another comment, what representation you want to use depends on what you take as basic.
There are certain types of preferences/behaviours which cannot be expressed using arithmetic averaging. These are the ones which violate VNM, and I think violating VNM axioms isn’t totally crazy. I think its worth exploring these VNM-violating preferences and seeing what they look like when more fleshed out. That’s what I tried to do in this post.
If I wanted a robot that violated one of the VNM axioms, then I wouldn’t be able to describe it by ‘nailing down the averaging method to use ordinary arithmetic averaging and assigning goodness values’. For example, if there were certain states of the world which I wanted to avoid at all costs (and thus violate the continuity axiom), I could assign zero utility to it and use geometric averaging. I couldn’t do this with arithmetic averaging and any finite utilities [1].
A better example is Scott Garrabrant’s argument regarding abandoning the VNM axiom of independence. If I wanted to program a robot which sometimes preferred lotteries to any definite outcome, I wouldn’t be able to program the robot using arithmetic averaging over goodness values.
I think that these examples show that there is at least some independence between averaging methods and utility/goodness.
(ok, I guess you could assign ‘negative infinity’ utility to those states if you wanted. But once you’re doing stuff like that, it seems to me that geometric averaging is a much more intuitive way to describe these preferences. )
Well, you can’t have some states as “avoid at all costs” and others as “achieve at all costs”, because having them in the same lottery leads to nonsense, no matter what averaging you use. And allowing only one of the two seems arbitrary. So it seems cleanest to disallow both.
But geometric averaging wouldn’t let you do that either, or am I missing something?
Fine. But the purpose of exploring different averaging methods is to see whether it expands the richness of the kind of behaviour we want to describe. The point is that using arithmetic averaging is a choice which limits the kind of behaviour we can get. Maybe we want to describe behaviours which can’t be described under expected utility. Having an ‘avoid at all costs state’ is one such behaviour which finds natural description using a non-arithmetic averaging which can’t be described in more typical VNM terms.
If your position is ‘I would never want to describe normative ethics using anything other than expected utility’ then that’s fine, but some people (like me) are interested in looking at what alternatives to expected utility might be. That’s why I wrote this post. As it stands, I didn’t find geometric averaging very satisfactory (as I wrote in the post), but I think things like this are worth exploring.
You are right. Geometric averaging on its own doesn’t give allow violations of independence. But some other protocol for deciding over lotteries does. It’s described more in the Garrabrant post linked above.
Nitpicks: (1) the vNM theorem is arguably about preference, not choice and behavior; and (2) “can’t be money pumped” is not one of the conditions in the theorem.
There are several different representation theorems, not just the one by VNM. They differ in what they take to be basic. See the table here in section 2.2.5. As the article emphasizes, nothing can be concluded from direction of representation about what is more fundamental:
E.g. you could also trivially “represent” preferences in terms of utilities by defining
x≻y:=U(x)>U(y).
This case isn’t mentioned in the table because a representation proof based on it would be too trivial to label it a “theorem” (for example, preferences are automatically transitive because utilities are represented by real numbers and the “larger than” relation on the real numbers is transitive).
If we want to argue what is more fundamental, we need independent arguments; formal representation relations alone are too arbitrary.
There are indeed a few such arguments. For example, it makes both semantic and psychological sense to interpret “I prefer x to y” as “I want x more than I want x”, but it doesn’t seem possible to interpret (semantically and psychologically) plausible statements like “I want x much more than I want y” or “I want x about twice as much as I want y” in terms of preferences, or preferences and probabilities. The reason is that the latter force you to interpret utility functions as invariant under addition of arbitrary constants, which can make utility levels arbitrarily close to each other. So we can interpret preferences as being explained by relations between degrees of desire (strength of wanting), but we can’t interpret desires as being explained by preference relations, or both preferences and probabilities.
No, you can only use the geometric expected utility for nonnegative utility functions.
If bounded below, you can just shift up to make it positive. But the geometric expected utility order is not preserved under shifts.
Thanks for pointing this out, I missed a word. I have added it now.
Violations of continuity aren’t really vulnerable to proper/standard money pumps. The author calls it “arbitrarily close to pure exploitation” but that’s not pure exploitation. It’s only really compelling if you assume a weaker version of continuity in the first place, but you can just deny that.
I think transitivity (+independence of irrelevant alternatives) and countable independence (or the countable sure-thing principle) are enough to avoid money pumps, and I expect give a kind of expected utility maximization form (combining McCarthy et al., 2019 and Russell & Isaacs, 2021).
Against the requirement of completeness (or the specific money pump argument for it by Gustafsson in your link), see Thornley here.
To be clear, countable independence implies your utilities are “bounded” in a sense, but possibly lexicographic. See Russell & Isaacs, 2021.
I feel that since people don’t care ultimately about money, all-nonzero probabilities will make all events have nonzero utility as well.