I just reran this test with Gemini 3 Pro Preview in my apartment.
It passed with flying colors. No real mistakes or major inefficiencies. Opus 4.5 performed worse, I didn’t end up running that round to completion.
I will note this is impacted by my place being a little smaller and less messy than what’s in the post, and I’m also not at all a coffee snob, but the actual coffee is serviceable imo.
I can share the Gemini chat if anyone’s interested.
Trying to share a chat out of AI Studio has proved annoying, as it turns out. I copied the transcript and took a full page screen capture instead, but the latter also turned out slightly scuffed with my usual tool. Apologies for the quality.
Thanks! It mostly did better at object recognition than me. I imagine I’d have improved with the full res unscuffed images, but I still don’t think I’d have recognized the fridge so quickly (though admittedly it wasn’t helpful). The only place I thought I’d have done better was when it got confused by the door in the mirror.
I just reran this test with Gemini 3 Pro Preview in my apartment.
It passed with flying colors. No real mistakes or major inefficiencies. Opus 4.5 performed worse, I didn’t end up running that round to completion.
I will note this is impacted by my place being a little smaller and less messy than what’s in the post, and I’m also not at all a coffee snob, but the actual coffee is serviceable imo.
I can share the Gemini chat if anyone’s interested.
I’m interested.
https://drive.google.com/drive/folders/1R_0NeKfGvdSpsR1Mh0FkTj50cvxV20Wa?usp=sharing
Trying to share a chat out of AI Studio has proved annoying, as it turns out. I copied the transcript and took a full page screen capture instead, but the latter also turned out slightly scuffed with my usual tool. Apologies for the quality.
Thanks! It mostly did better at object recognition than me. I imagine I’d have improved with the full res unscuffed images, but I still don’t think I’d have recognized the fridge so quickly (though admittedly it wasn’t helpful). The only place I thought I’d have done better was when it got confused by the door in the mirror.