Lovre comments on Putting multimodal LLMs to the Tetris test