Dagon comments on How could a friendly AI deal with humans trying to sabotage it? (like how present day internet trolls introduce such problems)

Dagon 28 Nov 2021 2:47 UTC
4 points
0
The question is whether the AI has any better mechanisms for dealing with this than current human members of society do. We seem to be heading down that slippery slope pretty quickly in the bigger cities on the US West Coast.
It remains an under-defined thing what “alignment” means. Most people assume it’s the more pleasant part of the distribution of human values, not a perfect representation of each existing human. Which may mean the solution is for the AI to determine the selectorate, or subset of humans who’ll be actively judging and correcting it, and find ways to make them happy. Presuming these people are squeamish, that probably doesn’t mean elimination of the disruptive, but it might include containment and minimization of interaction.
- Lone Pine 29 Nov 2021 6:00 UTC
  1 point
  0
  Parent
  Could the selectorate be all living (adult) humans?
  - Dagon 29 Nov 2021 14:28 UTC
    2 points
    0
    Parent
    Well, no, as that includes the trolls and other destructive people.
    - Lone Pine 30 Nov 2021 1:32 UTC
      1 point
      0
      Parent
      I mean something like direct democracy. Trolls wouldn’t be able to shape the ASI’s behavior unless they are 50%+1 of the human population. Something like that.