David Johnston comments on Bing Chat is blatantly, aggressively misaligned

David Johnston 16 Feb 2023 12:59 UTC
1 point
0
I agree that “I’m thinking about how to kill you” is not itself a highly concerning phrase. However, I think it’s plausible that an advanced LLM-like AI could hypnotise itself into taking harmful actions.