PhilosophyStudent

Karma: 41

PhilosophyStudent 19 Jun 2013 23:35 UTC
2 points
on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
My guess is that a large part of the divergence relates to the fact that LWers and philosophers are focused on different questions. Philosophers (two-boxing philosophers, at least) are focused on the question of which decision “wins” whereas LWers are focused on the question of which theory “wins” (or, at least, this is what it seems to me that a large group of LWers is doing, more on which soon).

So philosophical proponents of CDT will almost all (all, in my experience) agree that it is rational if choosing a decision theory to follow to choose a one-boxing decision theory but they will say that it is rational if choosing a decision to two-box.

A second part of the divergence seems to me to relate to the toolsets available to the divergent groups. LWers have TDT and UDT, philosophers have Parfit on rational irrationality (and a whole massive literature on this sort of issue).

I actually think that LWers need to be described into two distinct groups: those who have done a lot of research into decision theory and those that haven’t.

For those that haven’t, I suspect that the “disagreement” with philosophers is mostly apparent and not actual (these people don’t distinguish the two questions above and don’t realise that philosophers are answering the winning decision question and not the winning theory question and don’t realise that philosophers don’t just ignorantly set aside these issues but have a whole literature on rational irrationality, winning decisions vs winning theories and so on).

This is especially powerful because the story is often told on LW as if LW takes rationality to be about “winning” whereas philosophers are interested in analysing our concept of rationality (and surely it’s easy to pick between these). More accurately, though, it’s between two divergent views of “winning”.

For those that have studied more decision theory, the story is a different one and I don’t know that I know the views of these people in enough depth to comment on them.

PhilosophyStudent 20 Jun 2013 0:27 UTC
3 points
in reply to: Eliezer Yudkowsky’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
Yes. Which is to say, clearly you fall into the second class of people (those who have studied decision theory a lot) and hence my explanation was not meant to apply to you.

Which isn’t to say I agree with everything you say.

From the standpoint of reflective consistency, there should not be a divergence between rational decisions and rational algorithms; the rational algorithm should search for and output the rational decision,

Decisions can have different causal impacts to decision theories and so there seems to be no reason to accept this claim. Insofar as the rational decision is the decision which wins which depends on the causal effects of the decision and the rational algorithm is the algorithm which wins which depends on the causal effects of the algorithm then there seems to be no reason to think these should coincide. Plus, I like being able to draw distinctions that can’t be drawn using your terminology.

and the rational decision should be to adopt the rational algorithm.

Agreed (if you are faced with a decision of which algorithm to follow). Of course, this is not the decision that you’re faced with in NP (and adding more options is just to deny the hypothetical)

Suppose you regard Newcomb’s Problem as rewarding an agent with a certain decision-type, namely the sort of agent who one-boxes. TDT can be viewed as an algorithm which searches a space of decision-types and always decides to have the decision-type such that this decision-type has the maximal payoff. (UDT and other extensions of TDT can be viewed as maximizing over spaces broader than decision-types, such as sensory-info-dependent strategies or (in blackmail) maximization vantage points).

Yes, and I think this is an impressive achievement and I find TDT/UDT to be elegant, useful theories. The fact that I make the distinction between rational theories and rational decisions does not mean I cannot value the answers to both questions.

once you realize that a rational algorithm can just as easily maximize over its own decision-type as the physical consequences of its acts, there is just no reason to regard two-boxing as a winning decision or winning action in any sense, nor regard yourself as needing to occupy a meta-level vantage point in which you maximize over theories.

Well...perhaps. Obviously just because you can maximise over algorithms, it doesn’t follow that you can’t still talk about maximising over causal consequences. So either we have a (boring) semantic debate about what we mean by “decisions” or a debate about practicality: that is, the argument would be that talk about maximising over algorithms is clearly more useful than talk about maximising over causal consequences so why care about the second of these. For the most part, I buy this argument about practicality (but it doesn’t mean that two-boxing philosophers are wrong, just that they’re playing a game that both you and I feel little concern for).

This seems akin to precommitment, and precommitment means dynamic inconsistency means reflective inconsistency. Trying to maximize over theories means you have not found the single theory which directly maximizes without any recursion or metaness, and that means your theory is not maximizing the right thing.

I know what all these phrases mean but don’t know why it follows that your theory is not maximising the “right” thing. Perhaps it is not maximising a thing that you find to be useful or interesting (particularly for self-modifying AIs). If this is what you mean then fine. If you mean, however, that two-boxers are wrong on their own terms then I would need a more compelling argument (I’ve read your TDT paper btw, so reference to that won’t resolve things here).

Claiming that TDTers are maximizing over decision theories, then, is very much a CDT standpoint which is not at all how someone who sees logical decision theories as natural would describe it. From our perspective we are just picking the winning algorithm output (be the sort of agent who picks one box) in one shot, and without any retreat to a meta-level. The output of the winning algorithm is the winning decision, that’s what makes the winning algorithm winning.

Sure, the distinction is from the CDT perspective. You use words differently to the proponents of CDT (at which point, the whole difference between LWer views and philosopher’s views should be unsurprising). I’m not really interested in getting into a semantic debate though. I think that LWers are too quick to think that philosophers are playing the game wrong whereas I think the view should actually be that they’re playing the wrong game.

PhilosophyStudent 20 Jun 2013 0:50 UTC
5 points
in reply to: Eliezer Yudkowsky’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
Interesting. I have a better grasp of what you’re saying now (or maybe not what you’re saying, but why someone might think that what you are saying is true). Rapid responses to information that needs digesting are unhelpful so I have nothing further to say for now (though I still think my original post goes some way to explaining the opinions of those on LW that haven’t thought in detail about decision theory: a focus on algorithm rather than decisions means that people think one-boxing is rational even if they don’t agree with your claims about focusing on logical rather than causal consequences [and for these people, the disagreement with CDT is only apparent]).

ETA: On the CDT bit, which I can comment on, I think you overstate how “increasingly contorted” the CDTers “redefinitions of winning” are. They focus on whether the decision has the best causal consequences. This is hardly contorted (it’s fairly straightforward) and doesn’t seem to be much of a redefinition: if you’re focusing on “winning decisions” as the CDTer does (rather than “winning agents”) it seems to me that the causal consequences are the most natural way of separating out the part of the agent’s winning relates to the decision from the parts that relate to the agent more generally. As a definition of a winning decision, I think the definition used on LW is more revisionary than the CDTers definition (as a definition of winning algorithm or agent, the definition on LW seems natural but as a way of separating out the part of the agent’s winning that relate to the decision, logical consequences seems far more revisionary). In other words, everyone agrees what winning means. What people disagree about is when we can attribute the winningness to the decision rather than to some other factor and I think the CDTer takes the natural line here (which isn’t to say they’re right but I think the accusations of “contorted” definitions are unreasonable).

PhilosophyStudent 20 Jun 2013 23:04 UTC
4 points
in reply to: Eliezer Yudkowsky’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
To clarify: everyone should agree that the winning agent is the one with the giant heap of money on the table. The question is how we attribute parts of that winning to the decision rather than other aspects of the agent (because this is the game the CDTers are playing and you said you think they are playing the game wrong, not just playing the wrong game). CDTers use the following means to attribute winning to the decision: they attribute the winning that is caused by the decision. This may be wrong and there may be room to demonstrate that this is the case but it seems unreasonable to me to describe it as “contorted” (it’s actually quite a straightforward way to attribute the winning to the decision) and I think that using such descriptions skews the debate in an unreasonable way. This is basically just a repetition of my previous point so perhaps further reiteration is not of any use to either of us...

In terms of NP being “unfair”, we need to be clear about what the CDTer means by this (using the word “unfair” makes it sound like the CDTer is just closing their eyes and crying). On the basic level, though, the CDTer simply mean that the agent’s winning in this case isn’t entirely determined by the winning that can be attributed to the decision and hence that the agent’s winning is not a good guide to what decision wins. More specifically, the claim is that the agent’s winning is determined in part by things that are correlated with the agent’s decision but which aren’t attributable to the agent’s decision and so the agent’s overall winning in this case is a bad guide to determining which decision wins. Obviously you would disagree with the claims they’re making but this is different to claiming that CDTers think NP is unfair in some more everyday sense (where it seems absurd to think that Omega is being unfair because Omega cares only about what decision you are going to make).

I don’t necessarily think the CDTers are right but I don’t think the way you outline their views does justice to them.
What links here?
- PhilosophyStudent's comment on Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb? by CarlShulman (29 Jun 2013 0:45 UTC; 0 points)

PhilosophyStudent 20 Jun 2013 23:12 UTC
0 points
in reply to: shminux’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
CDT doesn’t assign credences to outcomes in the way you are suggesting.

One way to think about it is as follows: Basically CDT says that you should use your prior probability in a state (not an outcome) and update this probability only in those cases where the decision being considered causally influences the state. So whatever prior credence you had in the “box contains $M” state, given that the decision doesn’t causally influence the box contents, you should have that same credence regardless of decision and same for the other state.

There are so many different ways of outlining CDT that I don’t intend to discuss why the above account doesn’t describe each of these versions of CDT but some equivalent answer to that above will apply to all such accounts.

PhilosophyStudent 20 Jun 2013 23:16 UTC
0 points
in reply to: gjm’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
We need to distinguish two meanings of “being a proponent of CDT”. If by “be a proponent of CDT” we mean, “think CDT describes the rational decision” then the answer is simply that the CDTer thinks that rational decisions relate to the causal impact of decisions and rational algorithms relate to the causal impact of algorithms and so there’s no reason to think that the rational decision must be endorsed by the rational algorithm (as we are considering different causal impacts in the two cases).

If by “be a proponent of CDT” we mean “think we should decide according to CDT in all scenarios including NP” then we definitely have a problem but no smart person should be a proponent of CDT in this way (all CDTers should have decided to become one-boxers if they have the capacity to do so because CDT itself entails that this is the best decision)

PhilosophyStudent 20 Jun 2013 23:24 UTC
4 points
in reply to: PhilosophyStudent’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
So to summarise. On LW the story is often told as follows: CDTers don’t care about winning (at least not in any natural sense) and they avoid the problems raised by NP by saying the scenario is unfair. This makes the CDTer sound not just wrong but also so foolish it’s hard to understand why the CDTer exists.

But expanded to show what the CDT actually means, this becomes: CDTers agree that winning is what matters to rationality but because they’re interested in rational decisions they are interested in what winning can be attributed to decisions. Specifically, they say that winning can be attributed to a decision if it was caused by that decision. In response to NP, the CDTer notes that the agent’s overall winning is not a good guide to the winning decision as in this case, the agent’s winning it also determined by factors other than their decisions (that is, the winning cannot be attributed to the agent’s decision). Further, because the agent’s winnings correlate with their decisions, even though it can’t be attributed to their decisions, the case can be particularly misleading when trying to determine the winning decisions.

Now this second view may be both false and may be playing the wrong game but it at least gives the CDTer a fair hearing in a way that the first view doesn’t.
What links here?
- PhilosophyStudent's comment on Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb? by CarlShulman (29 Jun 2013 0:45 UTC; 0 points)

PhilosophyStudent 20 Jun 2013 23:55 UTC
0 points
in reply to: CarlShulman’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
This comment was only meant to suggest how it was internally consistent for a CDTer to:

consider it rational, when choosing a decision theory, to pick one that tells you to one-box; and

be a proponent of CDT, a decision theory that tells you to two-box?

In other words, I was not trying here to offer a defence of a view (or even an outline of my view) but merely to show why it is that the CDTer can hold both of these things without inconsistency.

Are you thinking of something like hiring a hitman to shoot you unless you one-box, so that the payoffs don’t match NP? Or of changing your beliefs about what you should do in NP?

I’m thinking about changing your dispositions to decide. How one might do that will depend on their capabilities (for myself, I have some capacity to resolutely commit to later actions without changing my beliefs about the rationality of that decision). For some agents, this may well not be possible.

PhilosophyStudent 21 Jun 2013 0:25 UTC
0 points
in reply to: CarlShulman’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
Yes. So it is consistent for a CDTer to believe that:

(1) When picking a decision theory, you should pick one that tells you to one-box in instances of NP where the prediction has not yet occurred; and

(2) CDT correctly describes two-boxing as the rational decision in NP.

I committed the sin of brevity in order to save time (LW is kind of a guilty pleasure rather than something I actually have the time to be doing).

PhilosophyStudent 29 Jun 2013 0:45 UTC
0 points
in reply to: CarlShulman’s comment on: Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?
Perhaps my earlier claim was too strong.

Nevertheless, I do think that people on LW who haven’t thought about the issues a lot might well not have a solid enough opinion to be either agreeing or disagreeing with the LW one-boxing view or the two-boxing philosopher’s view. I suspect some of these people just note that one-boxing is the best algorithm and think that this means that they’re agreeing with LW when in fact this leaves them neutral on the issue until they make their claim more precise.

I also think one of the reasons for the lack of two-boxers on LW is that LW often presents two-boxing arguments in a slogan form which fails to do justice to these arguments (see my comments here and here). Which isn’t to say that the two-boxers are right but is to say I think the debate gets skewed unreasonably in one-boxers’ favour on LW (not always, but often enough to influence people’s opinions).

Why one-box?

PhilosophyStudent30 Jun 2013 2:38 UTC

11 points

98 comments3 min readLW link

PhilosophyStudent 30 Jun 2013 3:14 UTC
2 points
in reply to: Qiaochu_Yuan’s comment on: Why one-box?
Thanks for the reply, more interesting arguments.

Two-boxers think that decisions are things that can just fall out of the sky uncaused.

I’m not sure that’s a fair description of two-boxers. Two-boxers think that the best way to model the causal effects of a decision are by intervention or something similar. At no point do two-boxers need to deny that decisions are caused. Rather, they just need to claim that the way you figure out the causal effects of an action are by intervention like modelling.

I also think there’s a tendency among two-boxers not to take the stakes of Newcomb’s problem seriously enough. Suppose that instead of offering you a million dollars Omega offers to spare your daughter’s life. Now what do you do?

I don’t claim to be a two-boxer so I don’t know. But I don’t think this point really undermines the strength of the two-boxing arguments.

PhilosophyStudent 30 Jun 2013 3:18 UTC
1 point
in reply to: ThrustVectoring’s comment on: Why one-box?
Okay. As a first point, it’s worth noting that the two-boxer would agree that you should submit one-boxing code because they agree that one-boxing is the rational agent type. However, they would disagree that one-boxing is the rational decision. So I agree that this is a good intuition pump but it is not one that anyone denies.

But you go further, you follow this claim up by saying that we should think of causation in Newcomb’s problem as being a case where causality is weird (side note: Huw Price presents an argument of this sort, arguing for a particular view of causation in these cases). However, I’m not sure I feel any “intuition pump” force here (I don’t see why I should just intuitively find these claims plausible).

PhilosophyStudent 30 Jun 2013 3:29 UTC
0 points
in reply to: ThrustVectoring’s comment on: Why one-box?
Yes. Personally, I think the analogy is too close to pump intuitions (or it doesn’t pump my intuitions though perhaps this is just my failure).

The two-boxer will say that if you can choose what code to submit, you should submit one-boxing code but that you shouldn’t later run this code. This is the standard claim that you should precommit to one-boxing but should two-boxing in Newcomb’s problem itself.

PhilosophyStudent 30 Jun 2013 3:40 UTC
0 points
in reply to: Qiaochu_Yuan’s comment on: Why one-box?

Yes, that’s what I mean by decisions falling out of the sky uncaused. When a two-boxer models the causal effects of deciding to two-box even if Omega predicts that they one-box, they’re positing a hypothetical in which Omega’s prediction is wrong even though they know this to be highly unlikely or impossible depending on the setup of the problem.

The two-boxer claims that causal consequences are what matters. If this is false, the two-boxer is already in trouble but if this is true then it seems unclear (to me) that the fact that the correct way of modelling causal consequences involves interventions should be a problem. So I’m unclear as to whether there’s really an independent challenge here. But I will have to think on this more so don’t have anything more to say for now (and my opinion may change on further reflection as I can see why this argument feels compelling).

And yes, I’m aware of how TDT sets up the causal diagrams.

I think it undermines their attractiveness. I would say unhesitatingly that one-boxing is the correct decision in that scenario because it’s the one that saves my daughter, and I would furthermore say this even if I didn’t have a decision theory that returned that as the correct decision.

In response, the two-boxer would say that it isn’t your decision that saves your daughter (it’s your agent type) and they’re not talking about agent type. Now I’m not saying they’re right to say this but I don’t think that this line advances the argument (I think we just end up where we were before).

PhilosophyStudent 30 Jun 2013 3:52 UTC
2 points
in reply to: Qiaochu_Yuan’s comment on: Why one-box?
No argument here. I’m very open to the suggestion that the two-boxer is answering the wrong question (perhaps they should be interested in rational agent type rather than rational decisions) but it is often suggested on LW that two-boxers are not answering the wrong question but rather are getting the wrong answer (that is, it is suggested that one-boxing is the rational decision, not that it is uninteresting whether this is the case).

PhilosophyStudent 30 Jun 2013 4:05 UTC
2 points
in reply to: Qiaochu_Yuan’s comment on: Why one-box?
I agree. A semantic debate is uninteresting. My original assumption about the differences between two-boxing philosophers and one-boxing LWers was that the two groups used words differently and were engaged in different missions.

If you think the difference is just:

(a) semantic; (b) a difference of missions; (c) a different view of which missions are important

then I agree and I also agree that a long hair splitting debate is uninteresting.

However, my impression was that some people on LW seem to think there is more than a semantic debate going on (for example, my impression was that this is what Eliezer thought). This assumption is what motivated the writing of this post. If you think this assumption is wrong, it would be great to know as if this is the case, I now understand what is going on.

PhilosophyStudent 30 Jun 2013 4:27 UTC
0 points
in reply to: Qiaochu_Yuan’s comment on: Why one-box?
Okay. Clarified, so to return to:

Okay, but why does the two-boxer care about decisions when agent type appears to be what causes winning (on Newcomblike problems)?

The two-boxer cares about decisions because they use the word decision to refer to those things we can control. So they say that we can’t control our past agent type but can control our taking of the one or two boxes. Of course, a long argument can be held about what notion of “control” we should appeal to here but it’s not immediately obvious to me that the two-boxer is wrong to care about decisions in their sense. So they would say that what thing we care about depends not only on what things can cause the best outcome but also on whether we can exert control over these things. The basic claim here seems reasonable enough.

PhilosophyStudent 30 Jun 2013 4:35 UTC
0 points
in reply to: DanielLC’s comment on: Why one-box?
This seems like an interesting point. If either time or causation doesn’t work in the way we generally tend to think it does then the intuitions in favour of CDT fall pretty quickly. However, timeless physics is hardly established science and various people are not very positive about the QM sequence. So while this seems interesting I don’t know that it helps me personally to come to a final conclusion on the matter.

PhilosophyStudent 30 Jun 2013 5:06 UTC
2 points
in reply to: Qiaochu_Yuan’s comment on: Why one-box?
It’s not clear to me that the argument outputs the wrong conclusion. Their daughters die because of their agent type at time of prediction not because of their decision and they can’t control their agent type at this past time so they don’t try to. It’s unclear that someone is irrational for exerting the best influence they can. Of course, this is all old debate so I don’t think we’re really progressing things here.