Tamsin Leake comments on [missing post]

Tamsin Leake 9 Mar 2024 16:34 UTC
22 points
23
If my sole terminal value is “I want to go on a rollercoaster”, then an agent who is aligned to me would have the value “I want Tamsin Leake to go on a rollercoaster”, not “I want to go on a rollercoaster myself”. The former necessarily-has the same ordering over worlds, the latter doesn’t.
- Dagon 9 Mar 2024 17:24 UTC
  3 points
  4
  Parent
  Quite. We don’t hear enough about individuality and competitive/personal drives when talking about alignment. I worry a lot that the abstraction and aggregation of “human” values completely misses the point of what most humans actually do.