An even less-formed thought: in the case of children, another important factor for their buying into restrictions on their own behavior is the realization that universal enforcement of those restrictions also protects them from others’ bad behavior. And it might be important to their learning process that they practice enforcing these norms in their own social interactions. (I am speaking from parenting experience, I don’t know the research literature on this topic). Not sure how this applies to AGI alignment, but this seems to fall more in the self-other overlap bin?
An even less-formed thought: in the case of children, another important factor for their buying into restrictions on their own behavior is the realization that universal enforcement of those restrictions also protects them from others’ bad behavior. And it might be important to their learning process that they practice enforcing these norms in their own social interactions. (I am speaking from parenting experience, I don’t know the research literature on this topic). Not sure how this applies to AGI alignment, but this seems to fall more in the self-other overlap bin?