[deleted] comments on Wanted: backup plans for “seed AI turns out to be easy”

[deleted] 29 Sep 2011 13:11 UTC
0 points
0
I think an Arbitration/Negotiation AI would be an interesting seed. The AI is given the ability to listen to disagreements to determine if two people disagree on some matter. It then uses their contact information and attempts to get them to come to an agreement by presenting various arguments. If it manages to have those two people both agree that they agree, it gains utility. It gains a substantially larger utility boost the first time it solves a type of argument then it does from subsequent solves. Perhaps a function like “Each subsequent solve of an argument offers half as much utility as the first solve.” The AI is also explicitly prevented from suggesting violence to prevent “You two should fight to the death and both the winner and the loser will agree that winner is right and I have solved the argument.” type scenarios. Any hint of the AI suggesting violence is an immediate externally imposed shutdown. Even if the situation really could be solved by violence in this particular case, shutdown anyway.

This appears to have a few good points:
1. It seems like it has a very, very, low rampancy factor. It can only send arguments and can only gain utility by consent of multiple humans. This AI Seed should also come to the conclusion along the lines that “Preserving Human life is good.” Because any person could potentially disagree with another person, and as such represents a possible source of utility.
2. It seems a necessary step to an FAI. If someone presented an AI with the caveat, “I’ve built an FAI, but it has no way of resolving even simple disputes peacefully, let alone complicated ones.”
3. The AI is encouraged to solve NOVEL arguments the most. So it isn’t just going to sit and only resolve the same argument, because solving a new argument the first time will give it more of a boost then repeatedly solving existing arguments. 4: If the AI does manage to negotiate away substantial disagreements that might lead to wars, it seems like it would reduce risks. There are existential risks that don’t involve us blowing each other up with nukes, but one project at a time.
Before you even get to that point though, it would probably be easier to build a Seed Seed Program that is capable of understanding that a disagreement is taking place in a formal text only setting where both people are rational and the disagreement is known to be solvable (One person is pretending to be under certain misconceptions that he will willingly acknowledge if the program asks.) and the program simply ends once agreement is reached.