avturchin comments on AI safety via market making

avturchin 27 Jun 2020 11:34 UTC
2 points
0
On possible way how it could go wrong:
M to H: “Run out of the room!”
H runs out.
Adv prints something, but H never reads it. So M reached stable output.
- edoarad 27 Jun 2020 14:42 UTC
  1 point
  0
  Parent
  I think that M only prints something after converging with Adv, and that Adv does not print anything directly to H
  - avturchin 27 Jun 2020 15:31 UTC
    2 points
    0
    Parent
    Yes, but all what I said could be just a convergent prediction of M. Not the real human runs out of the room, but M predicted that its model human of H’ will leave the room.