Don’t people notice the ways that seeing themselves engage in particular behaviors updates their own self-image? The payoff for being polite to language users is that it makes me see myself as the kind of person who is generally polite. The results of being mean and bullying would be that I would come to see myself as the kind of person who is okay with engaging in mean and bullying behavior to get what they want.
Or maybe everybody else has a skill at compartmentalizing which I lack? But I absolutely catch myself applying prompting strategies to human conversation, because the part of me which does that self-concept feedback loop doesn’t differentiate between my behaviors toward animate audiences versus my behaviors toward “inanimate” ones.
Don’t people notice the ways that seeing themselves engage in particular behaviors updates their own self-image? The payoff for being polite to language users is that it makes me see myself as the kind of person who is generally polite. The results of being mean and bullying would be that I would come to see myself as the kind of person who is okay with engaging in mean and bullying behavior to get what they want.
Or maybe everybody else has a skill at compartmentalizing which I lack? But I absolutely catch myself applying prompting strategies to human conversation, because the part of me which does that self-concept feedback loop doesn’t differentiate between my behaviors toward animate audiences versus my behaviors toward “inanimate” ones.
You can split your brain and treat LLMs differently, in a different language. Rather, I can and I think most people could as well
‘split your brain’ was inaccurate phrasing to use here, sorry