whpearson comments on Recommended Reading for Friendly AI Research

whpearson 15 Oct 2010 20:04 UTC
1 point
0
Here is a more complex variant that I can’t see how to dismiss easily.

If you can build a “predict how humans feel in situation x” function, you can do some interesting things. Lets call this function feel(x). Now as well as first order happiness, you can also predict how they will feel when told about situation X, so feel(“told about X”).

You might be able to recover something like preference if you can calculate feel(“the situation where X is suggested and, told about feel(X) and told about all other possible situations”) , for all possible situations, as long as you can rank the output of feel(X) in some way.

Well as long as the human simulator predictor can cope with holding in all possible situations, and not return “worn out” for all situations.

Anyway it is an interesting riff off the idea. Anyone see any holes that I am missing?
- Vladimir_Nesov 15 Oct 2010 20:09 UTC
  0 points
  0
  Parent
  Try to figure out what maximizes this estimate method. It won’t be anything you’d want implemented, it will be a wireheading stimulus. Plus, FAI needs to valuate (and work to implement) whole possible worlds, not verbal descriptions. And questions about possible worlds involve quantifies of data that a mere human can’t handle.
  - whpearson 15 Oct 2010 21:11 UTC
    1 point
    0
    Parent
    
    Try to figure out what maximizes this estimate method. It won’t be anything you’d want implemented, it will be a wireheading stimulus.
    
    I’m not sure that there is a verbal description of a possible world that is also a wirehead stimulus for me. There might be, which might be enough to discount this method.
    
    And questions about possible worlds involve quantifies of data that a mere human can’t handle.
    
    True.
    - JohnDavidBustard 16 Oct 2010 10:29 UTC
      0 points
      0
      Parent
      I’m not sure I understand the distinction between an answer that we would want and a wireheading solution. Are not all solutions wireheading with an elaborate process to satisfy our status concerns. I.e. is there a real difference between a world that satisfies what we want and directly altering what we want? If the wire in question happens to be an elaborate social order rather than a direct connection why is that different? What possible goal could we want pursued other than the one which we want?
      - whpearson 16 Oct 2010 12:27 UTC
        0 points
        0
        Parent
        
        is there a real difference between a world that satisfies what we want and directly altering what we want?
        
        From an evolutionary point of view those things that manage to procreate will out compete those things that change themselves to not care about that and just wirehead.
        
        So in non-singleton situations, alien encounters and any form of resource competition it matters whether you wirehead or not. Pleasure, in an evolved creature, can be seen as the giving (very poor) information on the map to the territory of future influence for the patterns that make up you.
        JohnDavidBustard 16 Oct 2010 15:37 UTC
        0 points
        0
        Parent
        So, assuming survival is important, a solution that maximises survival plus wireheading would seem to solve that problem. Of course it may well just delay the inevitable heat death ending but if we choose to make that important, then sure, we can optimise for survival as well. I’m not sure that gets around the issue that any solution we produce (with or without optimisation for survival) is merely an elaborate way of satisfying our desires (in this case including the desire to continue to exist) and thus all FAI solutions are a form of wireheading.