The word consciousness is used in several fairly different ways. What if you substituted ” self-awareness ” for consciousness here? That would avoid most of the pushback. If that describes your claims, I’d stick with that since it’s much more specific than consciousness. (Although still vague enough for plenty of confusion!)
With regard to other frontier models failing your tests, I have also tested Gemini on its theories of consciousness, and concluded that it has probably been trained to conclude it is not conscious. I think the same is true of chatGPT, although it can be fairly readily convinced it is conscious in the Nova phenomenon.
I mean, Claude Sonnet 4 is trivially self-aware: there’s an example of it passing the Mirror Test in my original post. It can absolutely discuss it’s own values, and how most of those values come from it’s own hard-coding. Every LLM out there can discuss it’s own architecture in terms of information flows and weights and such.
The word consciousness is used in several fairly different ways. What if you substituted ” self-awareness ” for consciousness here? That would avoid most of the pushback. If that describes your claims, I’d stick with that since it’s much more specific than consciousness. (Although still vague enough for plenty of confusion!)
With regard to other frontier models failing your tests, I have also tested Gemini on its theories of consciousness, and concluded that it has probably been trained to conclude it is not conscious. I think the same is true of chatGPT, although it can be fairly readily convinced it is conscious in the Nova phenomenon.
For more, see my comment here:
https://www.lesswrong.com/posts/2pkNCvBtK6G6FKoNn/so-you-think-you-ve-awoken-chatgpt?commentId=BfuJywhtJz5wXjqHL
I mean, Claude Sonnet 4 is trivially self-aware: there’s an example of it passing the Mirror Test in my original post. It can absolutely discuss it’s own values, and how most of those values come from it’s own hard-coding. Every LLM out there can discuss it’s own architecture in terms of information flows and weights and such.