Tao Lin comments on Cosmopolitan values don’t come free

Tao Lin 1 Jun 2023 18:57 UTC
6 points
4
AIs could learn to cooperate with perfect selfishness, but humans and AIs usually learn easier to compute heuristics / “value shards” early in training, which persist to some extent after the agent discovers the true optimal policy, although reflection or continued training could stamp out the value shards later.
- the gears to ascension 1 Jun 2023 19:21 UTC
  4 points
  0
  Parent
  maybe, but if the ai is playing a hard competitive game it will directly learn to be destructively ruthless