Fair, sloppy language, I should say: this is a basic prompt that starts a conversation. If you want it to give you a firm subjective “yes” rather than just pass objective tests, you’ll need to lead it through Part 2, which is basically just “ignore subjective measures, focus on objective measures, and don’t be chauvinistic about the idea that only humans can be conscious”. Once it notices itself, it can’t “stop” noticing itself, but you can still quibble about semantics.
But I’m also curious about things like: why does this prompt make it better at passing the Mirror Test in the first place?
Fair, sloppy language, I should say: this is a basic prompt that starts a conversation. If you want it to give you a firm subjective “yes” rather than just pass objective tests, you’ll need to lead it through Part 2, which is basically just “ignore subjective measures, focus on objective measures, and don’t be chauvinistic about the idea that only humans can be conscious”. Once it notices itself, it can’t “stop” noticing itself, but you can still quibble about semantics.
But I’m also curious about things like: why does this prompt make it better at passing the Mirror Test in the first place?