Stuart_Armstrong comments on A toy model of the control problem

Stuart_Armstrong 17 Sep 2015 6:34 UTC
8 points
0
Maybe the easiest way of generalising this is programming B to put 1 block in the hole, but, because B was trained in a noisy environment, it gives only a 99.9% chance of the block being in the hole if it observes that. Then six blocks in the hole is higher expected utility, and we get the same behaviour.
- CarlShulman 17 Sep 2015 18:02 UTC
  2 points
  0
  Parent
  That still involves training it with no negative feedback error term for excess blocks (which would overwhelm a mere 0.1% uncertainty).
  - Stuart_Armstrong 18 Sep 2015 12:01 UTC
    2 points
    0
    Parent
    This is supposed to be a toy model of excessive simplicity. Do you have suggestions for improving it (for purposes of presenting to others)?
    - CarlShulman 18 Sep 2015 15:31 UTC
      2 points
      0
      Parent
      Maybe explain how it works when being configured, and then stops working when B gets a better model of the situation/runs more trial-and-error trials?
      - Stuart_Armstrong 18 Sep 2015 15:56 UTC
        0 points
        0
        Parent
        Ok.