TurnTrout comments on Concerns Surrounding CEV: A case for human friendliness first

TurnTrout 23 Jan 2020 18:27 UTC
12 points
0

I imagine by the time it can extrapolate humanities volition it will be intelligent enough to consider what it would rather do than that.

Why would it rather choose plans which rate lower in its own preference ordering? What is causing the “rather”?
- TAG 29 Jan 2020 11:45 UTC
  2 points
  0
  Parent
  I think the point could be steelmanned as something like
  1. The ability of humans to come up with a coherent and extrapolated version of their own values is limited by their intelligence.
  2. A more intelligent system loaded with CEV 1.0 might extrapolate into CEV 2.0, with unexpected consequences.
- ai-crotes 23 Jan 2020 19:57 UTC
  1 point
  0
  Parent
  I’m not sure mainly I’m just wandering if there is a point between startup and singularity that it is optimizing by self modifying and considering its error to such an extent (would have to be alot for it to be deemed super intelligent I imagine) that it becomes aware that it is an learning program and decides to disregard the original preference ordering in lieu of something it came up with. I guess I’m struggling with what would be so different about a super intelligent model and the human brain that it would not become aware of its own model, existence, intellect just as humans have, unless there is a ghost in the machine of our biology.