Unfortunately, Gemini 3 Pro without the Deep Think optionmanaged to one-shot the problems 1,3,4,5 of the IMO 2025. I doubt that if we prompt the system to solve the problems one by one, then we won’t obtain the result of solving all but the Problem 6.
EDIT: fortunately, solving all the problems was a failure. Unfortunately, prompting the model to solve them one by one saw a great success of solving the problems 1, 3 and 5, failing the problem 2, encountered an error doing Problem 4.
That’s good (assuming no contamination, of course (I don’t expect it to break instructions not to search, but it could have seen them at some of the training phases)).
But this will be possible to double-check in the future with novel problems.
(I assume someone checked the correctness of these versions of solutions; this is just a conversation, but someone needs to assert checking the details.)
Unfortunately, Gemini 3 Pro without the Deep Think option managed to one-shot the problems 1,3,4,5 of the IMO 2025. I doubt that if we prompt the system to solve the problems one by one, then we won’t obtain the result of solving all but the Problem 6.
EDIT: fortunately, solving all the problems was a failure. Unfortunately, prompting the model to solve them one by one saw a great success of solving the problems 1, 3 and 5, failing the problem 2, encountered an error doing Problem 4.
That’s good (assuming no contamination, of course (I don’t expect it to break instructions not to search, but it could have seen them at some of the training phases)).
But this will be possible to double-check in the future with novel problems.
(I assume someone checked the correctness of these versions of solutions; this is just a conversation, but someone needs to assert checking the details.)