Tried it on newer models. GPT-5 continues to get it right, while Gemini 3 hallucinates a few details that can’t be seen clearly in the image. Sonnet 4.5 is off on a few details (but very close!). Opus 4.5 is the first non-OpenAI model to get it completely right.
Edit: With this image, however, none of the models mention that it’s the end to the first stage final boss fight (GPT-5 (both reasoning and non-reasoning) does mention that it is the first stage, but nothing about it being the end of that stage—Opus 4.5 mentions nothing about stages)
Tried it on newer models. GPT-5 continues to get it right, while Gemini 3 hallucinates a few details that can’t be seen clearly in the image. Sonnet 4.5 is off on a few details (but very close!).
Opus 4.5 is the first non-OpenAI model to get it completely right.
Edit: With this image, however, none of the models mention that it’s the end to the first stage final boss fight (GPT-5 (both reasoning and non-reasoning) does mention that it is the first stage, but nothing about it being the end of that stage—Opus 4.5 mentions nothing about stages)