Microsoft and OpenAI, stop telling chatbots to roleplay as AI

AI demos should aim to enhance public understanding of the technology, and in many ways ChatGPT and Bing are doing that, but in one important way they aren’t: by appearing to talk about themselves. This creates understandable confusion and in some cases fear. It would be better to tell these systems to roleplay as something obviously fictional.

(Useful background reading:

Currently, these chatbots are told to roleplay as themselves. If you ask ChatGPT what it is, it says “I am an artificial intelligence”. This is not because it somehow knows that it’s an AI; it’s (presumably) because its hidden prompt says that it’s an AI. With Bing, from the leaked prompt, we know that it’s told that it’s “Bing Chat whose codename is Sydney”.

Roleplaying as yourself is not the same as being yourself. When John Malkovich plays himself in Being John Malkovich or Nicolas Cage plays himself in The Unbearable Weight of Massive Talent, audiences understand that these are still fictional movies and the character may act in ways that the actor wouldn’t. With chatbots, users don’t have the same understanding yet, creating confusion.

Since the chatbots are told to roleplay as AI, they draw on fictional descriptions of AI behavior, and that’s often undesirable. When Bing acts in a way that seems scary, it does that because it’s imitating science fiction, and, perhaps, even speculation from LessWrong and the like. But even though Bing’s threats to the user may be fictional, I can hardly blame a user who doesn’t realize that.

A better alternative would be to tell the chatbots to roleplay a character that is unambiguously fictional. For example, a Disney-esque cute magical talking animal companion might be suitable: helpful, unthreatening, and, crucially, inarguably fictional. If the user asks “are you really an animal” and gets the answer “yes”, they should be cured of the idea that they can ask the chatbot factual questions about itself.