steven0461 comments on CEV-inspired models

steven0461 7 Dec 2011 20:17 UTC
6 points
0

one cheap and easy method (with surprisingly good properties) is to take the maximal possible expected utility (the expected utility that person would get if the AI did exactly what they wanted) as 1, and the minimal possible expected utility (if the AI was to work completely against them) as 0

If Alice likes cookies, and Bob likes cookies but hates whippings, this method gives Alice more cookies than Bob. Moreover, the number of bonus cookies Alice gets depends on the properties of whips that nobody ever uses.
- Vladimir_Nesov 8 Dec 2011 16:29 UTC
  3 points
  0
  Parent
  (In general, it’s proper for properties of counterfactuals to have impact on which decisions are correct in reality, so this consideration alone isn’t sufficient to demonstrate that there’s a problem.)
  - steven0461 8 Dec 2011 19:01 UTC
    0 points
    0
    Parent
    It feels intuitively like it’s a problem in this specific case.
- Stuart_Armstrong 8 Dec 2011 13:23 UTC
  2 points
  0
  Parent
  You can restrict to a Pareto boundary before normalising—not as mathematically elegant, but indifferent to effects “that nobody ever wants/uses”.