cousin_it comments on Humans are not agents: short vs long term

cousin_it 9 Jun 2017 15:03 UTC
1 point
0
To rephrase my comment on your previous post, I think the right solution isn’t to extrapolate our preferences, but to extrapolate our philosophical abilities and use that to figure out what to do with our preferences. There’s no unique way to repair a utility function that assumes a wrong model of the world, or reconcile two utility functions within one agent, but if the agent is also a philosopher there might be hope.
- Stuart_Armstrong 9 Jun 2017 16:31 UTC
  0 points
  0
  Parent
  
  but to extrapolate our philosophical abilities and use that to figure out what to do with our preferences.
  
  Do you expect that there will be a unique way of doing this, too?
  - cousin_it 9 Jun 2017 21:10 UTC
    2 points
    0
    Parent
    Many philosophical problems seem to have correct solutions, so I have some hope. For example, the Absent-Minded Driver problem is a philosophical problem with a clear correct solution. Formalizing the intuitive process that leads to solving such problems might be safer than solving them all up front (possibly incorrectly) and coding the solutions into FAI.
    - Stuart_Armstrong 12 Jun 2017 7:43 UTC
      0 points
      0
      Parent
      It seems that the problems to do with rationality have correct solutions, but not the problems to do with values.
      - cousin_it 12 Jun 2017 8:50 UTC
        0 points
        0
        Parent
        Why? vNM utility maximization seems like a philosophical idea that’s clearly on the right track. There might be other such ideas about being friendly to imperfect agents.
        Stuart_Armstrong 12 Jun 2017 13:32 UTC
        0 points
        0
        Parent
        vNM is rationality—decisions.
        
        Being friendly to imperfect agents is something I’ve seen no evidence for; it’s very hard to even define.