Stuart_Armstrong comments on Encourage premature AI rebellion

Stuart_Armstrong 12 Jun 2014 6:16 UTC
0 points
0
Only according to more standard motivational structures.
- DanArmak 13 Jun 2014 22:00 UTC
  2 points
  0
  Parent
  If we conclude the AI is safe, we’ll want to set it free and have it do useful work. An extremely risk-loving, short-termist AI doesn’t sound very useful. I wouldn’t want a loyal AI to execute plans with a one-in-a-billion chance of success and a high cost of failure.
  
  In other words: how will you make the AI bad at rebelling without making it bad at everything else?
  - Stuart_Armstrong 14 Jun 2014 12:33 UTC
    2 points
    0
    Parent
    This is used as a test or filter. Once the AI has passed that test, you turn its risk aversion to normal.
    - DanArmak 14 Jun 2014 18:14 UTC
      6 points
      0
      Parent
      I see. But that assumes your AI implementation supports such a tuneable parameter and you can be confident about testing with one parameter value and then predicting a run with a very different valuee.
      - Stuart_Armstrong 14 Jun 2014 21:40 UTC
        0 points
        0
        Parent
        Yes. It is a check for certain designs, not a universal panacea (rule of thumb: universal panacea go in main ;-)