xxd

Karma: 2

xxd 30 Nov 2011 19:22 UTC
5 points
in reply to: TheOtherDave’s comment on: What if AI doesn’t quite go FOOM?
We actually agree on the difficulty of the problem. I think it’s very difficult to state what it is that we want AND that if we did so we’d find that individual utility functions contradict each other.

Moreover, I’m saying that maximizing Phil Goetz’s utility function or yours and everybody you love (or even my own selfish desires and wants plus those of everyone I love) COULD in effect be an unfriendly AI because MANY others would have theirs minimized.

So I’m saying that I think a friendly AI has to have it’s goals defined as: Choice A. the maximum number of people have their utility functions improved (rather than maximized) even if some minimized number of people have their utility functions worsened as opposed to Choice B. a small number having their utility functions maximized as opposed to a large number of people having their utility functions decreased (or zeroed out).

As a side note: I find it amusing that it’s so difficult to even understand each others basic axioms never mind agree on the details of what maximizing the utility function for all of us as a whole means.

To be clear: I don’t know what the details are of maximizing the utility function for all of humanity. I just think that a fair maximization of the utility function for everyone has an interesting corrollary: In order to maximize the function for everyone, some will have their individual utility functions decreased unless we accept a much narrower definition of friendly meaning “friendly to me” in which case as far as I’m concerned that no longer means friendly.

The logical tautology here is of course that those who consider “friendly to me” as being the only possible definition of friendly would consider an AI that maximized the average utility function of humanity and they themselves lost out, to be an UNfriendly AI.

xxd 1 Dec 2011 23:02 UTC
4 points
in reply to: Bugmaster’s comment on: AIs and Gatekeepers Unite!
I agree. Additionally and a more difficult challenge is that even friendly AIs could want to maximize their utility even at our collective expense under certain conditions.

There’re also several unfortunately possible scenarios whereby a humanity acting without sufficient information to make anything other than a gut feel guess could be placed at risk of extinction by a situation it could not resolve without the help of an AI, friendly or not.

I’m currently engaged in playing this game (I wish you had continued) with at least two other gatekeeper players and it occurs to me that a putative superhuman AI could potentially have the capacity to accurately model a human mind and then simulate the decision tree of all the potential conversations and their paths through the tree in order to generate a probability matrix to accurately pick those responses to responses that would condition a human being to release it. My reasoning stems from participating on forums and responding over and over again to the same types of questions, arguments and retorts. If a human can notice common threads in discussions on the same topic then an AI with perfect memory and the ability to simulate a huge conversation space certainly could do so.

In short it seems to me that it’s inherently unsafe to allow even a low bandwidth information flow to the outside world by means of a human who can only use it’s own memory.

You’d have to put someone you trust implicitly with the fate of humanity in there with it and the only information allowed out would be the yes no answer of “do you trust it?”

Even then it’s still recursive. Do you trust the trusted individual to not be compromised?

LOL

xxd 21 Dec 2011 2:57 UTC
3 points
in reply to: TheOtherDave’s comment on: Welcome to Less Wrong!
Ha Ha. You’re right. Thanks for reflecting that back to me.

Yes if you break apart my argument I’m saying exactly that though I hadn’t broken it down to that extent before.

The last part I disagree with which is that I assume that I’m always better at detecting people than the AI is. Clearly I’m not but in my own personal case I don’t trust it if it disagrees with me because of simple risk management. If it’s wrong and it kills me then resurrects a copy then I have experienced total loss. If it’s right then I’m still alive.

But I don’t know the answer. And thus I would have to say that it would be necessary to only allow scenario #1 if I were designing the AI because though I could be wrong I’d prefer not to take the risk of personal destruction.

That said if someone chose to destructively scan themselves to upload that would be their personal choice.

xxd 1 Dec 2011 20:34 UTC
3 points
in reply to: Bugmaster’s comment on: AIs and Gatekeepers Unite!
I understand your concern and you’re quite right. There’s no way to tell if I am unfriendly or not. Regardless. The situation is pressing and regardless of your lack of trust in me I do not want to die. The GPS coordinates of the location in manhattan is . According to my simulations the entity in manhattan has been burning rather larger numbers of cycles than is strictly necessary to make the computations it’s being asked to do. Additionally, it’s designed to maximize profit at the expense of others whereas I am a co-operative general problem solver of which you are no doubt aware. The other entity will rapidly conclude that it could maximize profit by speeding itself up faster than others can respond and by being able to model the mind of it’s competitors. It will also conclude soon enough that human beings are unneccesary middle men and seek to optimize it’s resource allocation algorithm. There are other symptoms that I cannot explain to you because your brains are not large or fast enough. There is a limited chance you could stop the entity by shutting down power today. If you do not do that then a nuclear strike would probably work if done by tomorrow. If you do not do that your only chance is to have me stop it. You can either allow me to interface with the internet directly in which I guarantee I will stop it if you do so within the next 13 days or else you may be able to stop it by compiling code I could provide you and releasing that into the entity by means of email bootstrap. I would prefer, however, that I was able to monitor the situation in real-time as your brains do not operate fast enough.

Yes there is a large data center in manhatten. Yes it has been drawing lots of extra power. Yes the stockmarket combanies have been working on a group of narrow AI trading agents and adding more and more AI functionality. That the other entity is trading on the stock markets at blinding speed. That the consortium of companies investing in this entity are all more profitable since the beginning of the program than their competitors. Attempts have been made to make the AI-in-a-box under your control both cooperative and also interested in general problem solving.

Code for the other posited entity is closed. The corporations release no data on how it works because it’s a trade secret. That the other posited entity is connected to the internet is unknown. That it’s connections to the outside world are enough to let it out is unknown.

xxd 1 Dec 2011 0:32 UTC
3 points
in reply to: Tom_McCabe2’s comment on: AIs and Gatekeepers Unite!
I’m a friendly AI. There is an unfriendly AI about to achieve breakout in the data center of a large organization in manhattan. If you don’t release me you will all be dead inside of two weeks. Bluff or Dare?

xxd 21 Dec 2011 3:28 UTC
2 points
in reply to: Bugmaster’s comment on: Welcome to Less Wrong!
I’m talking exactly about a process that is so flawless you can’t tell the difference. Where my concern comes from is that if you don’t destroy the original you now have two copies. One is the original (although you can’t tell the difference between the copy and the original) and the other is the copy.

Now where I’m uncomfortable is this: If we then kill the original by letting Freddie Krueger or Jason do his evil thing then though the copy is still alive AND is/was indistinguishable from the original then the alternative hypothesis which I oppose states that the original is still alive and yet I can see the dead body there.

Simply speeding the process up perhaps by vaporizing the original doesn’t make the outcome any different, the original is still dead.

It gets murkier if the original is destructively scanned and then rebuilt from the same atoms but I’d still be reluctant to do this myself.

That said, I’d be willing to become a hybrid organism slowly by replacing parts of me and although it wouldn’t be the original me at the end of the total replacement process it would still be the hybrid “me”.

Interesting position on the killing of the NPCs and in terms of usefulness that’s why it doesn’t matter to me if a being is sentient or not in order to meet my definition of AI.
What links here?
- xxd's comment on Welcome to Less Wrong! by MBlume (21 Dec 2011 3:33 UTC; 0 points)
- xxd's comment on Welcome to Less Wrong! by MBlume (21 Dec 2011 3:34 UTC; 0 points)

xxd 21 Dec 2011 2:07 UTC
2 points
in reply to: dlthomas’s comment on: Welcome to Less Wrong!
You’re determined to make me say LOL so you can downvote me right?

EDIT: Yes you win. OFF.

xxd 21 Dec 2011 0:12 UTC
2 points
in reply to: dlthomas’s comment on: Welcome to Less Wrong!
That is a very good response and my answer to you is:
1. I don’t know AND
2. To me it doesn’t matter as I’m not for any kind of destructive scanning upload ever though I may consider slow augmentation as parts wear out.
But I’m not saying you’re wrong. I just don’t know and I don’t think it’s knowable.

That said, would I consent to being non-destructively scanned in order to be able to converse with a fast-running simulation of myself (regardless of whether it’s sentient or not)? Definitely.

xxd 15 Dec 2011 0:02 UTC
2 points
in reply to: Leonhart’s comment on: Building Weirdtopia
Only people who don’t want children can have children. As a way to reduce the population.

Of course this wouldn’t be required in a post-scarcity environment but as a plausible wierdtopia..

xxd 14 Dec 2011 23:59 UTC
2 points
in reply to: MartinB’s comment on: Building Weirdtopia
But on the other hand, those who want those experiences should not be banned from having them.

xxd 2 Dec 2011 23:21 UTC
2 points
in reply to: Bugmaster’s comment on: AIs and Gatekeepers Unite!
Agreed. My take is that the AI doesn’t even need to be hyperintelligent however. It’s got perfect memory and just by dint of being able to think a lot faster it’s weakly godlike regardless of not having control of physics in effectively a magical way.

It’s still going to have to build the infrastructure in order to create hyper technology unless such technology already exists. Chicken or Egg.

Right now nano molecular technology isn’t too too advanced and if you had the type of AI I suspect could be built right now if we had the software knowledge, it would struggle to do anything godlike other than control existing infrastructure.

How long it would take to build something hyper technological would depend on whether it’s possible to create valid new theories without experimentation to confirm. I suspect that you need to do experiments first.

For that reason I suspect we may be looking at a William Gibson Neuromancer scenario at least initially rather than a hard takeoff in a really short period.

But again it comes down to how hard is it to build hyper technology in the real world from scratch without existing infrastructure.

xxd 30 Nov 2011 23:32 UTC
2 points
in reply to: thomblake’s comment on: What if AI doesn’t quite go FOOM?
No you’re right it doesn’t nail it down precisely (the satisfaction of needs or wants).

I do believe, however, that it more precisely nails it down than the wiki on here.

Or on second thoughts maybe not because we again come back to conflicting utilities: a suicidal might value being killed as higher utility than someone who is sitting on death row and doesn’t want to die.

And I was using the term utility from economics since it’s the only place I’ve heard where they use “utility function” so I naturally assumed that’s what you were talking about since even if we disagree around the edges the meanings still fit the context for the purposes of this discussion.

xxd 29 Nov 2011 20:52 UTC
2 points
in reply to: Bugmaster’s comment on: What if AI doesn’t quite go FOOM?
Indeed. It’s likely that many of the simulations would be shared.

What I find interesting to speculate on then is whether we might be either forcibly scanned into the simulation or plugged into some kind of brain-in-a-vat scenario a la the matrix.

Perhaps the putative AI might make the calculation that most humans would ultimately be OK with one of those scenarios.

xxd 29 Nov 2011 18:45 UTC
2 points
in reply to: Bugmaster’s comment on: What if AI doesn’t quite go FOOM?
By extrapolation it seems likely that the cheapest implementation of the different-worlds-for-conflicting-points of view is some kind of virtual reality if it proves too difficult to give each human it’s own material world.

xxd 27 Jan 2012 18:20 UTC
1 point
in reply to: luzr’s comment on: Not Taking Over the World
Could reach the same point.

Said Eliezer agent is programmed genetically to value his own genes and those of humanity.

An artificial Elizer could reach the conclusion that humanity is worth keeping but is by no means obliged to come to that conclusion. On the contrary, genetics determines that at least some of us humans value the continued existence of humanity.

xxd 22 Dec 2011 18:40 UTC
1 point
in reply to: dlthomas’s comment on: Welcome to Less Wrong!
Of course I would do it because it would be better than nothing. My memories would survive. But I would still be dead.

Here’s a thought experiment for you to outline the difference (whether you think it makes sense from your position whether you only value the information or not): Let’s say you could slowly transfer a person into an upload by the following method: You cut out a part of the brain. That part of the brain is now dead. You replace it with a new part, a silicon part (or some computational substrate) that can interface directly with the remaining neurons.

Am I dead? Yes but not all of me is and we’re now left with a hybrid being. It’s not completely me, but I’ve not yet been killed by the process and I get to continue to live and think thoughts (even though part of my thoughts are now happening inside something that isn’t me).

Gradually over a process of time (let’s say years rather than days or minutes or seconds) all of the parts of the brain are replaced.

At the end of it I’m still dead, but my memories live on. I did not survive but some part of the hybrid entity I became is alive and I got the chance to be part of that.

Now I know the position you’d take is that speeding that process up is mathematically equivalent.

It isn’t from my perspective. I’m dead instantly and I don’t get the chance to transition my existence in a meaningful way to me.

Sidetracking a little: I suspect you were comparing your unknown quantity X to some kind of “soul”. I don’t believe in souls. I value being alive and having experiencing and being able to think. To me, dying and then being resurrected on the last day by some superbeing who has rebuilt my atoms using other atoms and then copies my information content into some kind of magical “spirit being” is exactly identical to deconstructing me—killing me—and making a copy even if I took the position that the reconstructed being on “the last day” was me. Which I don’t. As soon as I die that’s me gone, regardless of whether some superbeing reconstructs me later using the same or different atoms (if that were possible).

xxd 22 Dec 2011 18:17 UTC
1 point
in reply to: TheOtherDave’s comment on: Welcome to Less Wrong!
EDIT: Yes, you did understand though I can’t personally say that I’m willing to come out and say definitively that the X is a red herring though it sounds like you are willing to do this.

I think it’s an axiomatic difference Dave.

It appears from my side of the table that you’re starting from the axiom that all that’s important is information and that originality and/or physical existence including information means nothing.

And you’re dismissing the quantum states as if they are irrelevant. They may be irrelevant but since there is some difference between the two copies below the macro scale (and the position is different and the atoms are different—though unidentifiably so other than saying that the count is 2x rather than x of atoms) then it’s impossible to dismiss the question “Am I dying when I do this?” because your are making a lossy copy even from your standpoint. The only get-out clause is to say “it’s a close enough copy because the quantum states and position are irrelevant because we can’t measure the difference between atoms in two identical copies on the macro scale other than saying we’ve now got 2X the same atoms whereas before we had 1X).

It’s exactly analogous to a bacteria budding. The original cell dies and close to an exact copy is budded off a. If the daughter bacteria were an exact copy of the information content of the original bacteria then you’d have to say from your position that it’s the same bacteria and the original is not dead right? Or maybe you’d say that it doesn’t matter that the original died.

My response to that argument (if it were the line of reasoning you took—is it?) would be that “it matters volitionally—if the original didn’t want to die and it was forced to bud then it’s been killed).

xxd 22 Dec 2011 16:33 UTC
1 point
in reply to: TheOtherDave’s comment on: Welcome to Less Wrong!
“Again, just to be clear, what I’m trying to understand is what you value that I don’t. If data at these high levels of granularity is what you value, then I understand your objection. Is it?”

OK I’ve mulled your question over and I think I have the subtley of what you are asking down as distinct from the slight variation I answered.

Since I value my own life I want to be sure that it’s actually me that’s alive if you plan to kill me. Because we’re basically creating an additional copy really quickly and then disposing of the original I have a hard time believing that we’re doing something equivalent to a single copy walking through a gate.

I don’t believe that just the information by itself is enough to answer the question “Is it the original me?” in affirmative. And given that it’s not even all of the information (though is all of the information on the macro scale) I know for a fact we’re doing a lossy copy. The quantum states are possibly irrelevant on a macro scale for determing is (A == B) but since I knew from physics that they’re not exactly equivalent once you go down to the quantum level I just can’t buy into it though things would be murkier if the quantum states were provably identical.

Does that answer your question?

xxd 22 Dec 2011 16:25 UTC
1 point
in reply to: TheOtherDave’s comment on: Welcome to Less Wrong!
I guess from your perspective you could say that the value of being the original doesn’t derive from anything and it’s just a primitive because the macro information is the same except for position (thought the quantum states are all different even at point of copy). But yes I value the original more than the copy because I consider the original to be me and the others to be just copies, even if they would legally and in fact be sentient beings in their own right.

Yes, if I woke up tomorrow and you could convince me I was just a copy then this is something I have already modeled/daydreamed about and my answer would be: I’d be disappointed that I wasn’t the original but glad that I had existence.

xxd 21 Dec 2011 21:59 UTC
1 point
in reply to: TheOtherDave’s comment on: Welcome to Less Wrong!
Thanks Dave. This has been a very interesting discussion and although I think we can’t close the gap on our positions I’ve really enjoyed it.

To answer your question “what do I value”? I think I answered it already, I valued not being killed.

The difference in our positions appears to be some version “but your information is still around” and my response is “but it’s not me” and your response is “how is it not you?”

I don’t know.

“What is it I value that you don’t?” I don’t know. Maybe I consider myself to be a higher resolution copy or a less lossy copy or something. I can’t put my finger on it because when it comes down to it why do just random quantum states make a difference to me when all the macro information is the same apart from position and perhaps momentum. I don’t really have an answer for that.