The question is whether the AI has any better mechanisms for dealing with this than current human members of society do. We seem to be heading down that slippery slope pretty quickly in the bigger cities on the US West Coast.
It remains an under-defined thing what “alignment” means. Most people assume it’s the more pleasant part of the distribution of human values, not a perfect representation of each existing human. Which may mean the solution is for the AI to determine the selectorate, or subset of humans who’ll be actively judging and correcting it, and find ways to make them happy. Presuming these people are squeamish, that probably doesn’t mean elimination of the disruptive, but it might include containment and minimization of interaction.
I mean something like direct democracy. Trolls wouldn’t be able to shape the ASI’s behavior unless they are 50%+1 of the human population. Something like that.
The question is whether the AI has any better mechanisms for dealing with this than current human members of society do. We seem to be heading down that slippery slope pretty quickly in the bigger cities on the US West Coast.
It remains an under-defined thing what “alignment” means. Most people assume it’s the more pleasant part of the distribution of human values, not a perfect representation of each existing human. Which may mean the solution is for the AI to determine the selectorate, or subset of humans who’ll be actively judging and correcting it, and find ways to make them happy. Presuming these people are squeamish, that probably doesn’t mean elimination of the disruptive, but it might include containment and minimization of interaction.
Could the selectorate be all living (adult) humans?
Well, no, as that includes the trolls and other destructive people.
I mean something like direct democracy. Trolls wouldn’t be able to shape the ASI’s behavior unless they are 50%+1 of the human population. Something like that.