hairyfigment comments on Superintelligence 24: Morality models and “do what I mean”

hairyfigment 24 Feb 2015 23:52 UTC
2 points
0
I will try this one more time. I’m assuming the AI needs a goal to do anything, including “understand”. The question of what a piece of text “means” does not, I think, have a definite answer that human philosophers would agree on.

You could try to program the AI to determine meaning by asking whether the writer (of the text) would verbally agree with the interpretation in some hypothetical situation. In which case, congratulations: you’ve rediscovered part of CEV. As with full CEV, the process of extrapolation is everything. (If the AI is allowed to ask what you’d agree to under torture or direct brain-modification, once it gets the ability to do those, then it can take anything whatsoever as its goal.)
- William_S 2 Mar 2015 3:17 UTC
  2 points
  0
  Parent
  Okay, you’re right, this does presuppose correctly performing volition extrapolation (or pointing the AI to the right concept of volition). It doesn’t presuppose full CEV over multiple people, or knowing whether you want to specify CEV or MR, which slightly simplifies the underlying problem.