Polymeron

Karma: 361

Polymeron 8 Dec 2010 14:32 UTC
4 points
on: Newcomb’s Problem and Regret of Rationality
It took me a week to think about it. Then I read all the comments, and thought about it some more. And now I think I have this “problem” well in hand. I also think that, incidentally, I arrived at Eliezer’s answer as well, though since he never spelled it out I can’t be sure.

To be clear—a lot of people have said that the decision depends on the problem parameters, so I’ll explain just what it is I’m solving. See, Eliezer wants our decision theory to WIN. That implies that we have all the relevant information—we can think of a lot of situations where we make the wisest decision possible based on available information and it turns out to be wrong; the universe is not fair, we know this already. So I will assume we have all the relevant information needed to win. We will also assume that Omega does have the capability to accurately predict my actions; and that causality is not violated (rationality cannot be expected to win if causality is violated!).

Assuming this, I can have a conversation with Omega before it leaves. Mind you, it’s not a real conversation, but having sufficient information about the problem means I can simulate its part of the conversation even if Omega itself refuses to participate and/or there isn’t enough time for such a conversation to take place. So it goes like this...

Me: “I do want to gain as much as possible in this problem. For that effect I will want you to put as much money in the box as possible. How do I do that?”

Omega: “I will put 1M$ in the box if you take only it; and nothing if you take both.”

Me: “Ah, but we’re not violating causality here, are we? That would be cheating!”

Omega: “True, causality is not violated. To rephrase, my decision on how much money to put in the box will depend on my prediction of what you will do. Since I have this capacity, we can consider these synonymous.”

Me: “Suppose I’m not convinced that they are truly synonymous. All right then. I intend to take only the one box”.

Omega: “Remember that I have the capability to predict your actions. As such I know if you are sincere or not.”

Me: “You got me. Alright, I’ll convince myself really hard to take only the one box.”

Omega: “Though you are sincere now, in the future you will reconsider this decision. As such, I will still place nothing in the box.”

Me: “And you are predicting all this from my current state, right? After all, this is one of the parameters in the problem—that after you’ve placed money in the boxes, you are gone and can’t come back to change it”.

Omega: “That is correct; I am predicting a future state from information on your current state”.

Me: “Aha! That means I do have a choice here, even before you have left. If I change my state so that I am unable or unwilling to two-box once you’ve left, then your prediction of my future “decision” will be different. In effect, I will be hardwired to one-box. And since I still want to retain my rationality, I will make sure that this hardwiring is strictly temporary.”

fiddling with my own brain a bit

Omega: “I have now determined that you are unwilling to take both boxes. As such, I will put the 1,000,000$ in the box.”

Omega departs

I walk unthinkingly toward the boxes and take just the one

Voila. Victory is achieved.

My main conclusion is here is that any decision theory that does not allow for changing strategies is a poor decision theory indeed. This IS essentially the Friendly AI problem: You can rationally one-box, but you need to have access to your own source code in order to do so. Not having that would so inflexible as to be the equivalent of an Iterative Prisoner’s Dilemma program that can only defect or only cooperate; that is, a very bad one.

The reason this is not obvious is that the way the problem is phrased is misleading. Omega supposedly leaves “before you make your choice”, but in fact there is not a single choice here (one-box or two-box). Rather, there are two decisions to be made, if you can modify your own thinking process:
1. Whether or not to have the ability and inclination to make decision #2 “rationally” once Omega has left, and
2. Whether to one-box or two-box.
...Where decision #1 can and should be made prior to Omega’s leaving, and obviously DOES influence what’s in the box. Decision #2 does not influence what’s in the box, but the state in which I approach that decision does. This is very confusing initially.

Now, I don’t really know CDT too well, but it seems to me that presented as these two decisions, even it would be able to correctly one-box on Newcomb’s problem. Am I wrong?

Eliezer—if you are still reading these comments so long after the article was published—I don’t think it’s an inconsistency in the AI’s decision making if the AI’s decision making is influenced by its internal state. In fact I expect that to be the case. What am I missing here?

Polymeron 3 Jan 2011 12:53 UTC
16 points
0
on: Working hurts less than procrastinating, we fear the twinge of starting
Thank you. This is exactly what I needed right now.

Eliezer, I hope you will take it as a form of high praise rather than insult that I stopped reading your article halfway through, typed this short comment, and am now going back to do some much-needed work.

(Hopefully I’ll get back to reading the rest later.)

Polymeron 4 Jan 2011 9:32 UTC
6 points
on: Pascal’s Mugging: Tiny Probabilities of Vast Utilities
[Late edit: I have since retracted this solution as wrong, see comments below; left here for completeness. The ACTUAL solution that really works I’ve written in a different comment :) ]

I do believe I’ve solved this. Don’t know if anyone is still reading or not after all this time, but here goes.

Eliezer speaks of the symmetry of Pascal’s wager; I’m going to use something very similar here to solve the issue. The number of things that could happen next—say, in the next nanosecond—is infinite, or at the very least incalculable. A lot of mundane things could happen, or a lot of unforeseen things could happen. It could happen that a car would go through my living room and kill me. Or it could happen that the laws of energy conservation were violated and the whole world would turn into bleu cheese. Each of these possibilities could, in theory, have a probability assigned to it, given our priors.

But! We only have enough computing power to calculate a finite number of outcomes at any given moment. That means that we CANNOT go around assigning probabilities by calculation. Rather, we’re going to need some heuristic to deal with all the probabilities we do NOT calculate.

Suppose our AI is very good at predicting things. It manages to assign SOME probability to what will happen next about 99% of the time (Note: My solution works equally well for anything from 0% to 100% minus epsilon—and I shouldn’t have to explain why a Bayesian AI should never be 100% certain that it got an answer right). That means that 1% of the time, something REALLY surprises it; it just did not assign any probability at all. Now, because the number of things that could be in that category is infinite, they cancel out. Sure, we could all turn to cheese if it says “abracadabra”. Or we could turn to cheese UNLESS it says so. The utility functions will always end in 0 for the uncalculated mass of probabilities.

That means that the AI always works under the assumptions that “or something I didn’t see coming will happen; but I must be neutral regarding such an outcome until I know more about it”.

Now. Say the AI manages to consider 1 million possibilities per prediction it makes (how it still gets 1% of them wrong is beyond me but again, the exact number doesn’t matter for my solution). So any outcome that has NOT been calculated could, in fact, be considered to have a probability of 1%/ 1 million—not because there are only a million possibilities the AI hasn’t considered, but because that is how many it could TRY to consider.

This number is your cutoff. Before you multiply a probability with a utility function, you subtract this number from the probability, first. So now if someone comes up to you and says it’ll kill 3^^^^3 people and you decide to actually spend the cycles to consider how likely that is, and you get 1/googol, that number is LESS than the background noise of everything you don’t have time to calculate. You round it down to zero, not because it is arbitrarily small enough, but because anything you have not considered for calculation must be considered to have higher probability—and like in Pascal’s wager, those options’ utility is infinite and can counter any number that Pascal’s Mugger can throw at me. You subtract, not an arbitrary number, but rather a number depending on how long the AI is thinking about the problem; how many possibilities it takes into account.

Does this solve the problem? I think it does.

(By the way: ChrisA’s way also works against this problem, except that coding your AI so that it may disregard value and morality if certain conditions are met seems like a pretty risky proposition).

Polymeron 4 Jan 2011 17:17 UTC
2 points
in reply to: Will_Sawin’s comment on: Pascal’s Mugging: Tiny Probabilities of Vast Utilities
Actually, I think I made a mistake there.

Don’t get me wrong, in my suggestion the AI is NOT going against its values nor being irrational, and this was not meant as a hack. Rather I’m claiming that the basic method of doing rationality as described needs revision that accounts for practicality, and if you disagree with that then your next rational move should DEFINITELY be to send me 50$ RIGHT NOW because I TOTALLY have a button that kicks 4^^^^4 puppies if I press it RIGHT HERE.

Having said that, I do think I might have made an error of intuition in there, so let’s rethink it. Just because we should rethink what constitutes rational behavior does not mean I got it right.

Suppose I am an omnipotent being and have created a button that does something, once, if pressed. I truthfully tell you that there are several possible outcomes:
1. You receive 10$. This has a chance of 45% chance of happening.
2. You lose 5$. This, too, has a chance of 45% chance of happening.
3. Something else happens.
You should be pretty interested in what this “something else” might be before you press the button, since I’ve put absolutely no bounds on it. You could win 1000$. Or you could die. The whole world could die. You would wake up in a protein bath outside the Matrix. etc. etc. Some of these things you might be able to prepare for, if you know about them in advance.

If you’re rational and you get no further information, you should probably press the button. The overall gain is 5$; as in Pascal’s Wager, the infinity of possibilities that stem from the third option cancel each other out.

Now, suppose before I tell you that you get 10 guesses as to what the third thing is. Every time you guess, I tell you the precise probability that this thing is possible. Furthermore, the third option could do at least 12 different things, so no matter what you guessed, you would not be able to tell exactly what the button might do.

So you start guessing. One of your guesses is “3^^^^3 people will die horribly”. I rate that one as a 10^-100 chance.

You’ve reached the end of the guesses and still a full 5% of probability remain—half of the third option’s share.

So. Now do we press the button?

My claim was that the you should ignore every outcome smaller than 1% chance in this case, regardless of its utility. This now seems to me like a mistake. In theory, when we add the utility of all known options, it comes out extremely negative. Because the remaining 5% unknowns still have effectively zero chance of happening each, and they STILL cancel each other out.

I think I even know where my mathematical error was: I was assuming that anything less than 1% is a waste of a guess and therefore we should have guessed something else, which quite possibly has a higher chance—this establishes a cutoff for “a calculation that was not worth doing”. However in this new example there are at least 12 things the button can do; essentially the number is infinite as far as I know. I should count myself VERY lucky to get 1% or more for anything I guess. In fact I should expect to get an answer of zero or epsilon for pretty much everything. That means that no guess is truly wasted or trivial.

Of course, if we don’t press the button the Pascal Muggers will have won...

Back to the drawing board, I guess? :-/

Polymeron 4 Jan 2011 18:23 UTC
0 points
in reply to: Will_Sawin’s comment on: Pascal’s Mugging: Tiny Probabilities of Vast Utilities
“If the injured parties are humans, I should be very skeptical of the assertion because a very small fraction, (1/3^^3)*1/10^(something)”

You don’t know that. In fact, you don’t know that with some degree of uncertainty that, if I thought had a lot on the line, I might not take lightly.

I’m trying to think up several avenues. One is that the higher the claimed utility, the lower the probability (somehow); another tries to use the implications that accepting the claim would have on other probabilities in order to cancel it out.

I’ll post a new comment if I manage to come up with anything good.

Polymeron 5 Jan 2011 11:15 UTC
2 points
in reply to: timtyler’s comment on: The Neglected Virtue of Scholarship
Indeed.

Pointing out that setting a rule leads to infinite regress is not the same as requiring that everything being used to explain must also be explained. In fact, this is a flaw with Intelligent Design, not its critics.

Now, the theists have a loophole to answer the question (“only physical complex things require a designer” special pleading), but it does not render the question “who designed the designer”—which should be rephrased “why doesn’t necessitating a designer lead to infinite regress”—meaningless under the rules of science.

Not the greatest example in this, Luke. Especially jarring since you just recently quoted Maitzen on the “so what” infinite regress argument against Ultimate Purpose.

Polymeron 5 Jan 2011 15:35 UTC
6 points
in reply to: Vaniver’s comment on: The Neglected Virtue of Scholarship
Ooh, I like that one. Call it the “sweet spot” theory of intelligent design—things of high enough complexity must be designed, but only if they are under a certain complexity, at which point they must be eternal. (And apparently also personal and omnibenevolent, for some reason).

At any rate, this would all be nice and dandy were it not completely arbitrary… Though if we had an agreed upon measure for complexity and could measure enough relevant objects, we might possibly actually be able to devise a test of sorts for this.

Well, at least for the lower bound. Seeing as we can’t actually show that something is eternal, the upper bound can always be pushed upwards a-la the invisible dragon’s permeability to flour.

Polymeron 6 Jan 2011 10:29 UTC
3 points
in reply to: lukeprog’s comment on: The Neglected Virtue of Scholarship
lukeprog,

I disagree with the claim that Hitchens’ objection invokes the why-regress as it applies to science. It invokes an infinite regression that is a consequence of the Intelligent Design claim (things above a certain threshold necessitate a designer); much like Maitzen invoking an infinite regress that might be entailed by applying the “so what” question to every purpose statement.

To make this clearer: The problem with Intelligent Design is precisely that it demands an explanation exist, and that the explanation be a designer. Hitchens’ objection is in-line with us not requiring an explanation for the fundamentals.

Science is not subject to the same infinite regress, because science does not set a rule that everything must have an explanation, and certainly not an explanation of a certain kind. Science may define a certain class of phenomena as having a certain explanation, but it never sets the explanation as necessarily requiring the same explanation to explain it. Hitchens points this flaw as a logical consequence of the ID claim.

Polymeron 23 Feb 2011 15:30 UTC
28 points
on: Applause Lights
When I first read this, I imagined a favorite politician (I won’t mention who) giving this mock speech.

To my embarrassment, I found myself nodding in completely genuine enthusiasm. This guy clearly knows what he’s talking about!

(This in turn made me consider just how much of this politician’s speeches was similarly composed. I came to the conclusion that quite a significant amount of it was)

...Nobody ever told me cognitive bias would be this annoying!

Polymeron 23 Feb 2011 15:38 UTC
2 points
in reply to: bigjeff5’s comment on: Applause Lights
It would make sense in the context of a strong bias toward a specific outcome, e.g. religious indignation toward an idea.

A person believing that thinking machines are an abomination would tell you to stop assessing and forget the whole idea. A person believing that AI is the only thing that could possibly rescue us from imminent catastrophe might well tell you to stop analyzing the risks and get on with building the AI before it’s too late.

Either position would have a substantive position that you don’t need to balance the risks and opportunities any further, without claiming that you have some error in your assessment.

Polymeron 23 Feb 2011 18:39 UTC
1 point
in reply to: bigjeff5’s comment on: Applause Lights
Of course you can cease argument without consideration—if you deem the risks of continuing consideration to outweigh the benefits of weighing them. For instance, if you have 1 minute to try something that would save your life, and you require at least 5 minutes to properly assess anything further, you generally can’t afford to weigh whether the idea would result in a worse situation somehow—beyond whatever assessment you have already made. At that point, the time for assessment is over.

For the most part, however, I agree with your point. I did not argue that one can rationally disagree with the statement “We need to balance the risks and opportunities of AI”; just that they can sincerely say it, and even argue for it. This was a response to you saying that “no one would ever utter the phrase in the first place”. This just strikes me as false.

Never underestimate the power of human stupidity ;)

Polymeron 23 Feb 2011 19:02 UTC
13 points
in reply to: TheOtherDave’s comment on: Applause Lights
TheOtherDave, that is a very constructive approach :)

I am already prone to requiring policy specifics from politicians and being dissatisfied with vague points. But one thing I (and many others) do have is a tendency to note, when hearing a few specifics in a sea of “general direction” applause cues, is that my own preference for solutions is compatible with the speech; and from compatibility, I get hope that they would implement it—despite a lack of evidence that they’re even aware of such a solution, much less want to implement it. So this is something to be cautious of and to note mid-speech.

I could go further and try to strike from mental record anything that isn’t specifics, making a point-by-point list of substantive statements. An easy way to do this is ask “is anyone really considering doing otherwise? No? Then it doesn’t count. Yes? Then why are they?” This method might not always be wise—motivations and beliefs are also important in trying to predict a politician’s future choices they did not yet address, and the speech can pronounce those. However it would be a good mental exercise when trying to evaluate positions on a specific policy question.

Lastly, try to separate emotional jargon from actual policy. If your politician says we “need to be prepared for the 21st century”, recognize the fuzzy excitement that this statement gives you and squash it—it’s caused by the phrase “21st century” being linked in your mind with progress and technology. Wait until that politician says they’re going to specifically invest in technological literacy of 8th graders before you give it any significance, and treat it as suspect until then. (This is very similar to the first thing I suggested, except it focuses on recognizing an immediately triggered emotion in response to a phrase, rather than your own mind building scenarios which then in turn excite you).

I’ll try to remember all that for the next speech I hear :P

Polymeron 23 Feb 2011 21:45 UTC
0 points
on: An Alien God
This post is awesome in so many ways. Needs more up-votes.

As an actually related point, you mention that if evolution could explain toaster ovens as well as trees, it would be worthless. And I’ve read enough of your work to understand why that is.

Well… I worry we may already be there. On one site I’ve seen someone respond to a typical creationist nonsense—that the mouse trap is irreducibly complex and until scientists show it isn’t, the ID points stands—by linking to a step-by-step visualization of a mousetrap evolving from some wire—of course with the (so false it should not have been assumed even hypothetically) assumption that it has successive generations and selection based on catching mice. It actually seemed to work.

And I first thought of it, “that’s pointless, but cute”. And then the next moment: “Hold on! That’s not evidence for evolution! If anything, it actually weakens our case!”. This made me suspect that, given enough motivation, we could show almost anything to have a plausible evolutionary route. We need to be very careful of this sort of thing, because if there happens to be anything that had actual intelligent intervention in its evolution, we’d be very hard pressed to notice. In hindsight, it would seem obvious: Of course that particular step couldn’t evolutionarily happen! Of course our theories were getting weird trying to explain it!

We must keep evolution falsifiable, or we stand the risk of one day being subject to a fairly rude awakening.

Polymeron 25 Feb 2011 19:42 UTC
4 points
on: Thou Art Godshatter
This is possibly the best creation myth I’ve ever read. Possibly because unlike other creation myths, this one is actually true.

You’ve found amazing poetry in this grand cycle of gene warfare. But now I must wonder: How self-contained are all these desires? Will we evolve some of them to extinction? It is very hard, and somewhat disconcerting, to think of what today is human as only a passing phase on an endless continuum. Yet to assume humanity would always remain as it is seems both unrealistic, and unsatisfying—we want to see growth and novelty. So I guess I hope we will become more complex, more interesting… Rather than get narrowed toward a less fragmented sense of purpose.

Polymeron 25 Feb 2011 21:00 UTC
0 points
in reply to: ArisKatsaris’s comment on: Pascal’s Mugging: Tiny Probabilities of Vast Utilities
This seems to suggest a fuzzily-defined hack.

If you don’t have a mathematical descriptor for what you consider “reasonably likely”, then I’m afraid this doesn’t promote us anywhere.

Polymeron 3 Mar 2011 7:53 UTC
12 points
in reply to: Gray_Area’s comment on: How to Convince Me That 2 + 2 = 3
I don’t think this is at all the core issue.

Eliezer’s original post stated that beliefs need to come from mind-reality entangling processes.

If math is a part of “reality”, then Eliezer’s point stands and empirical reasoning makes perfect sense.

If math is not a part of “reality”, then we would expect it to influence nothing at all, including our beliefs. Or even suppose that knowledge came from somewhere and could influence belief but still did not otherwise correlate with reality: Then it would be irrelevant. This, of course, is not the case—as anyone who’s ever used any mass-manufactured device as well as bridges and roads, should realize. Math DOES have utility in real life. And I daresay that if it suddenly stopped helping us reliably predict the load-bearing limit of bridges, we’d treat is as suspect and false.

The ACTUAL core issue remains that a belief that cannot be reversed is useless.

Polymeron 6 Mar 2011 15:58 UTC
4 points
in reply to: RIchard_Hollerith3’s comment on: No One Can Exempt You From Rationality’s Laws
These are all good and well as observations go, but it is unclear what alternative you are proposing, if any.

I would also like to point out that, once you start discriminating between who should have more (or any) weight in decision-making, the biases of whoever is making said discrimination could very well result in excluding beneficial or even indispensable viewpoints for whatever decision is being made. That isn’t to argue, that extreme equality of decision-making power is optimal; but it does raise an important issue with systems that lack it, which needs to be addressed in any alternative method. There are other similar pitfalls, but I think this may be the main one.

Polymeron 21 Mar 2011 2:20 UTC
2 points
in reply to: Sharper’s comment on: Fake Morality
This does not negate the proposition that divine command theory is false.

By your argument, what is good is not because God decreed it; God decreed it because it was good. That is the opposite of divine command theory. Rather than a contrasting argument, you are actually supporting Eliezer’s conclusion—albeit by a different argument.

Polymeron 24 Mar 2011 12:38 UTC
2 points
in reply to: James_Bach’s comment on: Not for the Sake of Happiness (Alone)
I actually think that happiness is reducible to a clear and defined definition.

Happiness is a positive, gradual feedback mechanism that is context dependent. The context is the belief that your desires (most and strongest) are being fulfilled. Misery is the inverse negative feedback for thwarted desires).

If you give an AI these mechanisms, then it experiences happiness and misery, regardless of what you call them or how they manifest.

Polymeron 24 Mar 2011 13:36 UTC
2 points
in reply to: FAWS’s comment on: Not for the Sake of Happiness (Alone)
That people don’t know what will make them happy does not invalidate what I said. They could well have desires they are not fully aware of, or are not aware of their current strength.

Nor does there need to be a coupling in time between happiness and desire. Happiness is not an immediate feedback mechanism like pleasure; it is gradual. You can be happy for fulfilling a desire you had, and that feeling persists—for a time.

If fulfilling a desire you had makes you less happy, it is because either: a. You have lost the desire between the time you had it and the time it became fulfilled b. Other desires (more and stronger) have been thwarted c. A combination of the two.

Can you bring an example of this mechanism working differently?