Hacking humans

Crossposted at the Intelligent Agents Forum.

It should be noted that the colloquial “AI hacking a human” can mean three different things:

  1. The AI convinces/​tricks/​forces the human to do a specific action.

  2. The AI changes the values of the human to prefer certain outcomes.

  3. The AI completely overwhelms human independence, transforming them into a weak subagent of the AI.

Different levels of hacking make different systems vulnerable, and different levels of interaction make different types of hacking more or less likely.