Allan_Crossman

Karma: 188

Allan_Crossman 10 Aug 2008 2:38 UTC
45 points
on: Sorting Pebbles Into Correct Heaps
This post hits me far more strongly than the previous ones on this subject.

I think your main point is that it’s positively dangerous to believe in an objective account of morality, if you’re trying to build an AI. Because you will then falsely believe that a sufficiently intelligent AI will be able to determine the correct morality—so you don’t have to worry about programming it to be friendly (or Friendly).

I’m sure you’ve mentioned this before, but this is more forceful, at least to me. Thanks.

Personally, even though I’ve mentioned that I thought there might be an objective basis for morality, I’ve never believed that every mind (or even a large fraction of minds) would be able to find it. So I’m in total agreement that we shouldn’t just assume a superintelligent AI would do good things.

In other words, this post drives home to me that, pragmatically, the view of morality you propose is the best one to have, from the point of view of building an AI.

Allan_Crossman 4 Sep 2008 0:29 UTC
16 points
on: The True Prisoner’s Dilemma
Prase, Chris, I don’t understand. Eliezer’s example is set up in such a way that, regardless of what the paperclip maximizer does, defecting gains one billion lives and loses two paperclips.

Basically, we’re being asked to choose between a billion lives and two paperclips (paperclips in another universe, no less, so we can’t even put them to good use).

The only argument for cooperating would be if we had reason to believe that the paperclip maximizer will somehow do whatever we do. But I can’t imagine how that could be true. Being a paperclip maximizer, it’s bound to defect, unless it had reason to believe that we would somehow do whatever it does. I can’t imagine how that could be true either.

Or am I missing something?

Allan_Crossman 4 Sep 2008 2:00 UTC
15 points
on: The True Prisoner’s Dilemma
Michael: This is not a prisoner’s dilemma. The nash equilibrium (C,C) is not dominated by a pareto optimal point in this game.

I don’t believe this is correct. Isn’t the Nash equilibrium here (D,D)? That’s the point at which neither player can gain by unilaterally changing strategy.

Allan_Crossman 21 Jan 2009 23:24 UTC
10 points
on: Failed Utopia #4-2
Is this a “failed utopia” because human relationships are too sacred to break up, or is it a “failed utopia” because the AI knows what it should really have done but hasn’t been programmed to do it?

Allan_Crossman 3 Sep 2008 22:01 UTC
9 points
on: The True Prisoner’s Dilemma
I agree: Defect!

Clearly the paperclip maximizer should just let us have all of substance S; but a paperclip maximizer doesn’t do what it should, it just maximizes paperclips.

I sometimes feel that nitpicking is the only contribution I’m competent to make around here, so… here you endorsed Steven’s formulation of what “should” means; a formulation which doesn’t allow you to apply the word to paperclip maximizers.

Allan_Crossman 27 Jul 2008 13:43 UTC
7 points
on: Religion’s Claim to be Non-Disprovable
I hope the priests of Baal checked that it was indeed water, and not some sort of accelerant.

Allan_Crossman 21 Oct 2008 18:03 UTC
6 points
on: Prices or Bindings?
If a serial killer comes to a confessional, and confesses that he’s killed six people and plans to kill more, should the priest turn him in? I would answer, “No.” If not for the seal of the confessional, the serial killer would never have come to the priest in the first place.

It’s important to distinguish two ways this argument might work. The first is that the consequences of turning him in are bad, because future killers will be (or might be) less likely to seek advice from priests. That’s a fairly straightforward utilitarian argument.

But the second is that turning him in is somehow bad, regardless of the consequences, because the world in which every “confessor” did as you do is a self-defeating, impossible world. This is more of a Kantian line of thought.

Eliezer, can you be explicit which argument you’re making? I thought you were a utilitarian, but you’ve been sounding a bit Kantian lately. :)

Allan_Crossman 4 Sep 2008 12:05 UTC
5 points
on: The True Prisoner’s Dilemma
simpleton: won’t each side choose to cooperate, after correctly concluding that it will defect iff the other does?

Only if they believe that their decision somehow causes the other to make the same decision.

CarlJ: How about placing a bomb on two piles of substance S and giving the remote for the human pile to the clipmaximizer and the remote for its pile to the humans?

It’s kind of standard in philosophy that you aren’t allowed solutions like this. The reason is that Eliezer can restate his example to disallow this and force you to confront the real dilemma.

Vladimir: It’s preferrable to choose (C,C) [...] if we assume that other player also bets on cooperation.

No, it’s preferable to choose (D,C) if we assume that the other player bets on cooperation.

decide self.C; if other.D, decide self.D

We’re assuming, I think, that you don’t get to know what the other guy does until after you’ve both committed (otherwise it’s not the proper Prisoner’s Dilemma). So you can’t use if-then reasoning.

Allan_Crossman 17 Jul 2008 21:53 UTC
5 points
on: The Gift We Give To Tomorrow
None of us can say what our descendants will or will not do, but there is no reason to believe that any particular part of human nature will be worthy in their eyes. [emphasis mine]

I can see one possible reason: we might have some influence over what they think.

Allan_Crossman 7 Jul 2008 14:32 UTC
5 points
on: Adaptation-Executers, not Fitness-Maximizers
Re: “Individual organisms are best thought of as adaptation-executers rather than as fitness-maximizers”.

It’s a bit like saying deep blue is an instruction executor, not an expected chess position utility maximizer.

Not really. Deep Blue’s programming is so directly tied to winning chess, maximizing the value of its position is definitely what it “intends”. It actually “thinks about” how well it’s doing in this regard.

Living things, on the other hand, are far from explicit fitness maximizers. Evolution has given them behaviours that, in most natural circumstances, are fairly good at helping their genes. But in unusual circumstances they may well do things that are totally useless.

Humans today, for example, totally fail to maximize their fitness, e.g. by choosing to have just a small family and using contraception. We’re in an unusual situation—evolution knew nothing about condoms.

Allan_Crossman 20 Sep 2008 23:38 UTC
4 points
on: How Many LHC Failures Is Too Many?
First collisions aren’t scheduled to have happened yet, are they? In which case, the failure can’t be seen as anthropic evidence yet, since we might as well be in a world where it hasn’t failed, since such a world wouldn’t have been destroyed yet in any case.

But if I’m not mistaken, even old failures will become evidence retrospectively once first collisions are overdue, since (assuming the unlikely case of the LHC actually being dangerous) all observers still alive would be in a world where the LHC failed; when it failed being irrelevant.

As much as the AP fascinates me, it does my head in. :)

Allan_Crossman 16 Sep 2008 1:05 UTC
4 points
on: My Best and Worst Mistake
From your perspective, you should chalk this up to the anthropic principle: if I’d fallen into a true dead end, you probably wouldn’t be hearing from me on this blog.

I’m not sure that can properly be called anthropic reasoning; I think you mean a selection effect. To count as anthropic, my existence would have to depend upon your intellectual development; which it doesn’t, yet. :)

(Although I suppose my existence as Allan-the-OB-reader probably does so depend… but that’s an odd way of looking at it.)

Allan_Crossman 5 Jan 2009 18:00 UTC
3 points
on: Changing Emotions
that can support the idea that the much greater incidence of men committing acts of violence is “natural male aggression” that we can’t ever eliminate.

The whole point of civilisation is to defeat nature and all its evils.

Allan_Crossman 21 Oct 2008 21:31 UTC
3 points
on: Prices or Bindings?
Paul, that’s a good point.

Eliezer: If all I want is money, then I will one-box on Newcomb’s Problem.

Mmm. Newcomb’s Problem features the rather weird case where the relevant agent can predict your behaviour with 100% accuracy. I’m not sure what lessons can be learned from it for the more normal cases where this isn’t true.

Allan_Crossman 15 Aug 2008 23:22 UTC
3 points
on: The Bedrock of Morality: Arbitrary?
Eliezer, I think I kind-of understand by now why you don’t call yourself a relativist. Would you say that it’s the “psychological unity of mankind” that distinguishes you from relativists?

A relativist would stress that humans in different cultures all have different—though perhaps related—ideas about “good” and “right” and so on. I believe your position is that the bulk of human minds are similar enough that they would arrive at the same conclusions given enough time and access to enough facts; and therefore, that it’s an objective matter of fact what the human concepts of “right” and “good” actually mean.

And since we are human, there’s no problem in us continuing to use those words.

Am I understanding correctly?

It seems like your position would become more akin to relativism if the “psychological unity” turned out to be dubious, or if our galaxy turned out to be swarming with aliens, and people were forced to deal with genuinely different minds. In those cases, would there still be anything to separate you from actual relativists?

(In either case, it would still be an objective matter of fact what any given mind would call “good” if given enough time—but that would be a much less profound fact than it is for a species all alone and in a state of psychological unity.)

Allan_Crossman 29 Jul 2008 15:04 UTC
3 points
on: The Meaning of Right
Eliezer: It’s because when I say right, I am referring to a 1-place function

Like many others, I fall over at this point. I understand that Morality_8472 has a definite meaning, and therefore it’s a matter of objective fact whether any act is right or wrong according to that morality. The problem is why we should choose it over Morality_11283.

Of course you can say, “according to Morality_8472, Morality_8472 is correct” but that’s hardly helpful.

Ultimately, I think you’ve given us another type of anti-realist relativism.

Eliezer: But if you were stepping outside the human and hoping for moral arguments that would persuade any possible mind, even a mind that just wanted to maximize the number of paperclips in the universe, then sorry—the space of possible mind designs is too large to permit universally compelling arguments.

It’s at least conceivable that there could be objective morality without universally compelling moral arguments. I personally think there could be an objective foundation for morality, but I wouldn’t expect to persuade a paperclip maximizer.

Allan_Crossman 24 Jul 2008 11:53 UTC
3 points
on: Can Counterfactuals Be True?
Oh, and to talk about “the probability that John F. Kennedy was shot, given that Lee Harvey Oswald didn’t shoot him”, we write:

P(Kennedy_shot|Oswald_not)

If I’ve understood you, this is supposed to be a high value near 1. I’m just a noob at Bayesian analysis or Bayesian anything, so this was confusing me until I realised I also had to include all the other information I know: i.e. all the reports I’ve heard that Kennedy actually was shot, that someone else became president, and so on.

It seems like this would be a case where it’s genuinely helpful to include that background information:

P(Kennedy_shot | Oswald_not & Reports_of_Kennedy_shot) = 1 or thereabouts

And to talk about “the probability that John F. Kennedy would have been shot, if Lee Harvey Oswald hadn’t shot him”, we write:

P(Oswald_not []-> Kennedy_shot)

Presumably this is the case where we pretend that all that background knowledge has been discarded?

P(Kennedy_shot | Oswald_not & no_knowledge_of_anything_after_October_1963) = 0.05 or something?

Allan_Crossman 9 Jul 2008 16:08 UTC
3 points
on: Adaptation-Executers, not Fitness-Maximizers
An expected fitness maximiser is just an expected utility maximiser, where the utility function is God’s utility function.

I searched Google for “expected utility maximiser” and the 6th hit was your own website:

An expected utility maximiser is a theoretical agent who considers its actions, computes their consequences and then rates them according to a utility function.

The typical organism just doesn’t do this. I think you’d have a hard time arguing that even a higher mammal does this.

I am not clear about your claim that Deep Blue thinks, but organisms do not. Are you ignoring animals?

I didn’t say organisms don’t think. I said they don’t think about their fitness. They think about things like surviving, eating, finding mates, and so on, all of which usually contribute to reproduction in a natural environment.

The proof of this really is the way that a great many humans have indeed rebelled against their genes, and knowingly choose not to maximise their fitness. Dawkins, for example, has only one child. As a high-status male, he could presumably have had many more.

Allan_Crossman 28 Jun 2008 17:32 UTC
3 points
on: [Error communicating with LW2 server]
most people don’t see the overwhelming irony of their lives

This is an intriguing comment—but what exactly does it refer to? :-)

Allan_Crossman 5 Jun 2008 16:36 UTC
3 points
on: Living in Many Worlds
Sorry for the double post, but I just had a “Eureka moment”, and I think I can now explain the intuitive appeal of the idea of Quantum Immortality. It might still be wrong, but I can explain the appeal.

As above, a “successor” is a being who is psychologically continuous with you and remembers being you, et cetera. I want to consider 4 cases:

Case 1: MWI is false. In almost any normal circumstances (i.e. not involving teleporters or uploading), a person either has one successor or zero.

Case 2: MWI is true. In normal circumstances, a person has a huge number of successors.

Case 3: MWI is true. A person undergoes the ⁵⁰⁄₅₀ experiment, and still has a very large number of successors (though only half as many as in Case 2).

Case 4: MWI is true. A person undergoes a ¹⁰⁰⁰⁄₁ experiment, in which he dies with 99.9% probability. But because MWI is true, he still has a large number of successors (though only 0.1% of the number in Case 2).

Quantum Immortality is appealing insofar as Case 4 still seems to be better than Case 1. In other words, the combination of [MWI true, event with high chance of death] intuitively seems better than [MWI false, ordinary boring event].

As I said, it can still be wrong, but I think it’s appealing for reasons along these lines.