Tao Lin comments on What are the best arguments for/against AIs being “slightly ‘nice’”?

Tao Lin 25 Sep 2024 23:03 UTC
3 points
2
A core part of Paul’s arguments is that having 1/million of your values towards humans only applies a minute amount of selection pressure against you. It could be that coordinating causes less kindness because without coordination it’s more likely some fraction of agents have small vestigial values that never got selected against or intentionally removed
- Vaniver 18 Aug 2025 19:29 UTC
  2 points
  0
  Parent
  I think this is not particularly relevant if entities are deliberately adopting competitive personas in order to win contests. It might take a lot of mutation to drop the 1/million penalty, but it probably doesn’t take a lot of cognition for an agent that believes a meme like “winning isn’t everything, it’s the only thing.”