Linda Linsefors comments on Two agents can have the same source code and optimise different utility functions

Linda Linsefors 11 Jul 2018 0:50 UTC
LW: 4 AF: 4
AF
I agree.
An even simpler example: If the agents are reward learners, both of them will optimize for their own reward signal, which are two different things in the physical world.