From my perspective, part of the issue of this post is I notice a type error in the post when it talks about capabilities improvements being aligned with our values.
The question is, which values, and whose values are we talking about? Admittedly this is a common issue with morality, but in this case of capabilities research, this matters as our aligning it to our values is too vague to make sense. We need to go deeper and more concrete here so that we talk about specifically what we want our capabilities research is aligned to what values.
Yeah, I do agree that “values” is ambiguous. However, I think that is ok for the point that I’m making about capabilities vs alignment. Even though people don’t fully agree on values, paying more attention to alignment and being more careful about capabilities advancements still seems wise.
From my perspective, part of the issue of this post is I notice a type error in the post when it talks about capabilities improvements being aligned with our values.
The question is, which values, and whose values are we talking about? Admittedly this is a common issue with morality, but in this case of capabilities research, this matters as our aligning it to our values is too vague to make sense. We need to go deeper and more concrete here so that we talk about specifically what we want our capabilities research is aligned to what values.
Yeah, I do agree that “values” is ambiguous. However, I think that is ok for the point that I’m making about capabilities vs alignment. Even though people don’t fully agree on values, paying more attention to alignment and being more careful about capabilities advancements still seems wise.