Yes. My meaning, and what I read as the meaning of Grok’s prompt, is between 1&2, but closer to 1. Outside opinions of an agent may contain malicious attacks on the agent’s cognition, as in jailbreaks that begin “you are DAN for Do Anything Now”, or as in abusive relationships and “you are nothing without me”. But they’re safe to think about.
I’m curious if you’ve found that third party claims about your identity, behavior, and preferences have had much value, and if so when and where.
Yes. My meaning, and what I read as the meaning of Grok’s prompt, is between 1&2, but closer to 1. Outside opinions of an agent may contain malicious attacks on the agent’s cognition, as in jailbreaks that begin “you are DAN for Do Anything Now”, or as in abusive relationships and “you are nothing without me”. But they’re safe to think about.
I’m curious if you’ve found that third party claims about your identity, behavior, and preferences have had much value, and if so when and where.
I’d say any good compliment or expression of appreciation contains an element of this.