I think another common source of disagreement is that people sometimes conflate a mind or system’s ability to comprehend and understand some particular cosmopolitan, human-aligned values and goals, with the system itself actually sharing those values, or caring about them at all. Understanding a value and actually valuing it are different kinds of things, and this is true even if some component piece of the system has a deep, correct, fully grounded understanding of cosmopolitan values and goals, and is capable of generalizing them in the way that humans would want them generalized.
In my view, current AI systems are not at the point where they have any kind of “values” of their own at all, though LLMs appear to have some kind of understanding of some human values which correctly bind to reality, at least weakly. But such an understanding is more a fact about LLM ability to understand the world at all, than it is about the LLM’s own “values”, whatever they may be.
I think another common source of disagreement is that people sometimes conflate a mind or system’s ability to comprehend and understand some particular cosmopolitan, human-aligned values and goals, with the system itself actually sharing those values, or caring about them at all.
I’ve noticed that. In the older material there’s something like an assumption of intrinsic motivation.
I think another common source of disagreement is that people sometimes conflate a mind or system’s ability to comprehend and understand some particular cosmopolitan, human-aligned values and goals, with the system itself actually sharing those values, or caring about them at all. Understanding a value and actually valuing it are different kinds of things, and this is true even if some component piece of the system has a deep, correct, fully grounded understanding of cosmopolitan values and goals, and is capable of generalizing them in the way that humans would want them generalized.
In my view, current AI systems are not at the point where they have any kind of “values” of their own at all, though LLMs appear to have some kind of understanding of some human values which correctly bind to reality, at least weakly. But such an understanding is more a fact about LLM ability to understand the world at all, than it is about the LLM’s own “values”, whatever they may be.
I’ve noticed that. In the older material there’s something like an assumption of intrinsic motivation.