Benya comments on Pascal’s Mugging for bounded utility functions

Benya 25 Feb 2013 5:20 UTC
0 points
0

What you need to avoid the mugging is that P(more than X years of fun iff I pay up | deal for X years offered) goes to zero faster than log(X) grows

No.

Okay, you’re saying that as X goes up, the probability of getting X years of fun even if you don’t pay up also goes up, because any program that offers a deal of X years has to include a specification of the number X? So the expected utility of not paying up doesn’t stay constant as we vary X, but increases with X (at least asymptotically, for very large X)?

Well, you’re right on that, and that’s in fact a point I hadn’t considered, thanks. But I was replying to this:

In particular, if Solomonoff induction assigns to models where starting from now you get X years of fun a total probabilty p(X) that asymptotically decreases fast enough with X so that U(X) * p(X) also decreases, it will not be subject to Pascal’s mugging.

If by this you mean something else than that P(more than X years of fun iff I pay up | deal for X years offered) log(X) → 0, then I don’t understand what you mean by p(X). If you mean e.g. p(X) = P(deal for X years offered), then why would p(X) U(X) → 0 avoid Pascal’s mugging? (Not that it does go to zero.)

As you can see this posterior doesn’t depend on n^^^n or even on n, which is clearly inconsistent with the notion (formalized in a theorem by Solomonoff) that Solomonoff induction learns an accurate model of the environment.

That theorem says, roughly (actually I’m just giving a particular consequence), that given a particular world program, after seeing a certain finite number of bits produced by this program, Solomonoff induction will predict all future bits correctly. The particular number of bits needed initially depends, of course, on the program. (More generally: Given a computable probability distribution over world programs, with probability one there is some number of bits after which Solomonoff induction’s conditional distribution over subsequent bits will equal the “true” conditional distribution.) Of course, Solomonoff’s theorem only allows the agent to observe the environment, not interact with it, but that doesn’t seem to be the issue here (we can consider Hutter’s variants instead).

You are not keeping the world program fixed, and for each world program considered, you only talk about what happens after the agent has a certain number of bits (which is fixed given the world program, i.e. you don’t let it tend to infinity), so the theorem does not apply.

What antecedent do you want to deny in your argument, anyway? If your argument worked, it would still work if we replaced “grant X years of fun” by “write the symbol FUN on the tape X times”. But there is certainly a program B that reads X from the internal tape, writes X to the output tape, reads a symbol from the input tape, and writes FUN X times iff the input symbol is ACCEPT, and similarly for C and W(n^^^n).