Wes_W

Karma: 620

Wes_W 14 Feb 2017 18:48 UTC
2 points
0
in reply to: Pimgd’s comment on: Is Willpower a Finite Resource, or a Myth?
“Willpower is not exhaustible” is not necessarily the same claim as “willpower is infallible”. If, for example, you have a flat 75% chance of turning down sweets, then avoiding sweets still makes you more likely to not eat them. You’re not spending willpower, it’s just inherently unreliable.

Wes_W 29 Nov 2016 3:20 UTC
3 points
0
in reply to: thrawnca’s comment on: Counterfactual Mugging

I’m pretty sure that decision theories are not designed on that basis.

You are wrong. In fact, this is a totally standard thing to consider, and “avoid back-chaining defection in games of fixed length” is a known problem, with various known strategies.

Wes_W 31 Oct 2016 20:18 UTC
3 points
0
in reply to: thrawnca’s comment on: Counterfactual Mugging
Yes, that is the problem in question!

If you want the payoff, you have to be the kind of person who will pay the counterfactual mugger, even once you no longer can benefit from doing so. Is that a reasonable feature for a decision theory to have? It’s not clear that it is; it seems strange to pay out, even though the expected value of becoming that kind of person is clearly positive before you see the coin. That’s what the counterfactual mugging is about.

If you’re asking “why care” rhetorically, and you believe the answer is “you shouldn’t be that kind of person”, then your decision theory prefers lower expected values, which is also pathological. How do you resolve that tension? This is, once again, literally the entire problem.

Wes_W 28 Oct 2016 5:30 UTC
3 points
0
in reply to: thrawnca’s comment on: Counterfactual Mugging
Your decision is a result of your decision theory, and your decision theory is a fact about you, not just something that happens in that moment.

You can say—I’m not making the decision ahead of time, I’m waiting until after I see that Omega has flipped tails. In which case, when Omega predicts your behavior ahead of time, he predicts that you won’t decide until after the coin flip, resulting in hypothetically refusing to pay given tails, so—although the coin flip hasn’t happened yet and could still come up heads—your yet-unmade decision has the same effect as if you had loudly precommitted to it.

You’re trying to reason in temporal order, but that doesn’t work in the presence of predictors.

Wes_W 16 Oct 2016 6:49 UTC
3 points
0
in reply to: thrawnca’s comment on: Counterfactual Mugging
You’re fundamentally failing to address the problem.

For one, your examples just plain omit the “Omega is a predictor” part, which is key to the situation. Since Omega is a predictor, there is no distinction between making the decision ahead of time or not.

For another, unless you can prove that your proposed alternative doesn’t have pathologies just as bad as the Counterfactual Mugging, you’re at best back to square one.

It’s very easy to say “look, just don’t do the pathological thing”. It’s very hard to formalize that into an actual decision theory, without creating new pathologies. I feel obnoxious to keep repeating this, but that is the entire problem in the first place.

Wes_W 14 Sep 2016 23:29 UTC
6 points
0
in reply to: thrawnca’s comment on: Counterfactual Mugging

But in the single-shot scenario, after it comes down tails, what motivation does an ideal game theorist have to stick to the decision theory?

That’s what the problem is asking!

This is a decision-theoretical problem. Nobody cares about it for immediate practical purpose. “Stick to your decision theory, except when you non-rigorously decide not to” isn’t a resolution to the problem, any more than “ignore the calculations since they’re wrong” was a resolution to the ultraviolet catastrophe.

Again, the point of this experiment is that we want a rigorous, formal explanation of exactly how, when, and why you should or should not stick to your precommitment. The original motivation is almost certainly in the context of AI design, where you don’t HAVE a human homunculus implementing a decision theory, the agent just is its decision theory.

Wes_W 18 Aug 2016 17:07 UTC
2 points
0
in reply to: thrawnca’s comment on: Counterfactual Mugging

There will never be any more 10K; there is no motivation any more to give 100. Following my precommitment, unless it is externally enforced, no longer makes any sense.

This is the point of the thought experiment.

Omega is a predictor. His actions aren’t just based on what you decide, but on what he predicts that you will decide.

If your decision theory says “nah, I’m not paying you” when you aren’t given advance warning or repeated trials, then that is a fact about your decision theory even before Omega flips his coin. He flips his coin, gets heads, examines your decision theory, and gives you no money.

But if your decision theory pays up, then if he flips tails, you pay $100 for no possible benefit.

Neither of these seems entirely satisfactory. Is this a reasonable feature for a decision theory to have? Or is it pathological? If it’s pathological, how do we fix it without creating other pathologies?

Wes_W 17 Aug 2016 15:36 UTC
2 points
0
in reply to: thrawnca’s comment on: Counterfactual Mugging
Decision theory is an attempt to formalize the human decision process. The point isn’t that we really are unsure whether you should leave people to die of thirst, but how we can encode that in an actual decision theory. Like so many discussions on Less Wrong, this implicitly comes back to AI design: an AI needs a decision theory, and that decision theory needs to not have major failure modes, or at least the failure modes should be well-understood.

If your AI somehow assigns a nonzero probability to “I will face a massive penalty unless I do this really weird action”, that ideally shouldn’t derail its entire decision process.

The beggars-and-gods formulation is the same problem. “Omega” is just a handy abstraction for “don’t focus on how you got into this decision-theoretic situation”. Admittedly, this abstraction sometimes obscures the issue.

Wes_W 15 Aug 2016 23:49 UTC
2 points
0
in reply to: thrawnca’s comment on: Counterfactual Mugging
Precommitments are used in decision-theoretic problems. Some people have proposed that a good decision theory should take the action that it would have precommitted to, if it had known in advance to do such a thing. This is an attempt to examine the consequences of that.

Wes_W 11 Jul 2016 2:52 UTC
1 point
0
in reply to: Yosarian2’s comment on: Fake Explanations
I’m not sure you’ve described a different mistake than Eliezer has?

Certainly, a student with a sufficiently incomplete understanding of heat conduction is going to have lots of lines of thought that terminate in question marks. The thesis of the post, as I read it, is that we want to be able to recognize when our thoughts terminate in question marks, rather than assuming we’re doing something valid because our words sound like things the professor might say.

Wes_W 8 Jun 2016 20:31 UTC
2 points
0
in reply to: SquirrelInHell’s comment on: Morality of Doing Simulations Is Not Coherent [SOLVED, INVALID]
No part of his objection hinged on reversibility, only the same linearity assumption you rely on to get a result at all.

Wes_W 7 Jun 2016 21:44 UTC
1 point
0
in reply to: SquirrelInHell’s comment on: Morality of Doing Simulations Is Not Coherent [SOLVED, INVALID]
OK. I think I see what you are getting at.

First, one could simply reject your conclusion:

However at no point did I do anything that could be described as “simulating you”.

The argument here is something like “just because you did the calculations differently doesn’t mean your calculations failed to simulate a consciousness”. Without a real model of how computation gives rise to consciousness (assuming it does), this is hard to resolve.

Second, one could simply accept it: there are some ways to do a given calculation which are ethical, and some ways that aren’t.

I don’t particularly endorse either of these, by the way (I hold no strong position on simulation ethics in general). I just don’t see how your argument establishes that simulation morality is incoherent.

Wes_W 7 Jun 2016 6:24 UTC
9 points
0
on: Morality of Doing Simulations Is Not Coherent [SOLVED, INVALID]

From the point of view of physics, it contains garbage,

But a miracle occurs, and your physics simulation still works accurately for the individual components...?

I get that your assumption of “linear physics” gives you this. But I don’t see any reason to believe that physics is “linear” in this very weird sense. In general, when you do calculations with garbage, you get garbage. If I time-evolve a simulation of (my house plus a bomb) for an hour, then remove all the bomb components at the end, I definitely do not get the same result as running a simulation with no bomb.

Wes_W 2 Apr 2016 5:28 UTC
10 points
0
on: What makes buying insurance rational?

And apparently insurance companies can make money because the expected utility of buying insurance is lower than it’s price.

No, the expected monetary value of insurance is lower than its price. (Assuming that the insurance company’s assessment of your risk level is accurate.) You’re equivocating between money and utility, which is the source of your confusion.

Suppose I offered a simple wager: we flip a coin, and if it comes up heads, I give you a million dollars. But if it comes up tails, you owe me a million dollars, and I get every cent you earn until that debt is paid. Is this bet fair?

Monetarily, yes. But even if I skew the odds in your favor a little, maybe ⁶⁰⁄₄₀, I’ll bet you still don’t want to take it. Why not? Isn’t an expected return of $200,000 wildly in your favor?

Yeah, but that doesn’t matter. An extra million dollars would make your life somewhat better; spending the next twenty years flat broke would make your life drastically worse. Expected utility is very negative.

The utility of money is sometimes claimed to be logarithmic. For small amounts of money you can use a linear approximation, but if the outcome can shift you to a totally different region of the curve, the concavity becomes very important.

Wes_W 19 Nov 2015 4:40 UTC
0 points
0
in reply to: TheAncientGeek’s comment on: Reflexive self-processing is literally infinitely simpler than a many world interpretation
Non-locality, surely? Or “would violate locality”?

Wes_W 15 Nov 2015 2:27 UTC
2 points
0
in reply to: mgin’s comment on: Reflexive self-processing is literally infinitely simpler than a many world interpretation
Because we can’t actually get infinite information, but we still want to calculate things.

And in practice, we can in fact calculate things to some level of precision, using a less-than-infinite amount of information.

Wes_W 22 Oct 2015 15:41 UTC
0 points
0
in reply to: gjm’s comment on: The mystery of Brahms
You’re right, I missed that line.

Wes_W 22 Oct 2015 0:56 UTC
2 points
0
in reply to: gjm’s comment on: The mystery of Brahms
If I were making music in the style of someone who died six years before I was born, people would probably think I was out of style. I’m not sure if this is the historical fallacy I don’t have a name for, where we gloss over differences in a few decades because they’re less salient to us than the differences between the 1990s and the 1960s, or if musical styles just change more quickly now.

Wes_W 3 Oct 2015 18:52 UTC
0 points
0
in reply to: Good_Burning_Plastic’s comment on: The Infinity Project
I spent a long time associating Amazon with “something in South America, so it’s probably not accessible to me” before the company was as ultra-famous as it is now.

Wes_W 21 Aug 2015 0:55 UTC
0 points
0
in reply to: Vaniver’s comment on: Help Build a Landing Page for Existential Risk?
On the other hand, asteroid mining technologies have some risks of their own, although this only reaches “existential” if somebody starts mining the big ones.

The largest nuclear weapon was the Tsar Bomba: 50 megatonnes of TNT, roughly equivalent to a 3.3-million-tonne impactor. Asteroids larger than this are thought to number in the tens of millions, and at the time of writing only 1.1 million had been provisionally identified. Asteroid shunting at or beyond this scale is by definition a trans-nuclear technology, which means a point comes where the necessary level of trust is unprecedented.