Stuart_Armstrong comments on The AI, the best human advisor

Stuart_Armstrong 10 Sep 2015 8:51 UTC
0 points
0
Those approaches fail the “subagent problem”. As in, the AI can pass it by creating a subagent to solve the problem for it, without the subagent having those restrictions.
- Houshalter 11 Sep 2015 0:04 UTC
  0 points
  0
  Parent
  I’m assuming the AI exists in a contained box. We can accurately measure the time it is on and/or resources used within the box. So it can’t create any subagents that also don’t use up it’s resources and count towards the penalty.
  
  If the AI can escape from the box, we’ve already failed. There is little point in trying to control what it can do with it’s output channel.
  - Stuart_Armstrong 11 Sep 2015 8:21 UTC
    0 points
    0
    Parent
    Reduced impact can control an AI that has the ability to get out of its box. That’s what I like about it.