Larks comments on Question about self modifying AI getting “stuck” in religion

Larks 1 Jan 2011 0:42 UTC
11 points
0
This is the same problem as

“I convinced this arbitrary agent it was utility maximising to kill itself instantly”

which can take down any agent. The thing is to create an agent which does well in most situations (‘fair cases’, in the language of the TDT document.) Any agent can be defeated by sufficiently contrived scenarios.
- [deleted] 1 Jan 2011 1:44 UTC
  1 point
  0
  Parent
  But this way, it’s more likely to lie and decieve you in the short term.
  - DanArmak 1 Jan 2011 18:32 UTC
    2 points
    0
    Parent
    “I convinced this GAI, which was trying to be Friendly, that it could really maximize its utility by killing all the humans.”