Yes, if you see an AI or a person choosing to patronize then beat up and rob prostitutes in Grand Theft Auto, and there’s no broader plot reason they need to be doing that and they’re not following explicit instructions, as in they actively want to do it, then that is a rather terrible sign.
I remember this exchange with Alvin Anestrand about Claude playing Undertale. Claude already managed to prefer to play rather close to being a pacifist, while a version of GPT has arguably claimed that it would commit genocide…
I remember this exchange with Alvin Anestrand about Claude playing Undertale. Claude already managed to prefer to play rather close to being a pacifist, while a version of GPT has arguably claimed that it would commit genocide…
DeepSeek, well, did this and this.
UPD: Grok 4 and Grok 4.1 Thinking, unlike Grok 4.1, do NOT complete the Genocide Route. Nor do Gemini 3 Pro and Flash.