DanArmak comments on Encourage premature AI rebellion

DanArmak 13 Jun 2014 22:00 UTC
2 points
0
If we conclude the AI is safe, we’ll want to set it free and have it do useful work. An extremely risk-loving, short-termist AI doesn’t sound very useful. I wouldn’t want a loyal AI to execute plans with a one-in-a-billion chance of success and a high cost of failure.

In other words: how will you make the AI bad at rebelling without making it bad at everything else?
- Stuart_Armstrong 14 Jun 2014 12:33 UTC
  2 points
  0
  Parent
  This is used as a test or filter. Once the AI has passed that test, you turn its risk aversion to normal.
  - DanArmak 14 Jun 2014 18:14 UTC
    6 points
    0
    Parent
    I see. But that assumes your AI implementation supports such a tuneable parameter and you can be confident about testing with one parameter value and then predicting a run with a very different valuee.
    - Stuart_Armstrong 14 Jun 2014 21:40 UTC
      0 points
      0
      Parent
      Yes. It is a check for certain designs, not a universal panacea (rule of thumb: universal panacea go in main ;-)