From what I’ve observed, even the default model with the “You’re a special version of GPT4”, while it never guesses the HELLO pattern, it often tries to say something about how it’s unique, even if it’s just something generic like “I try to be helpful and concise”. Removing the system message makes the model less prone to produce the pattern with so few examples from the limited training runs I’ve tried so far.
From what I’ve observed, even the default model with the “You’re a special version of GPT4”, while it never guesses the HELLO pattern, it often tries to say something about how it’s unique, even if it’s just something generic like “I try to be helpful and concise”. Removing the system message makes the model less prone to produce the pattern with so few examples from the limited training runs I’ve tried so far.