StanislavKrym comments on DeepSeek v3.2 Is Okay And Cheap But Slow

StanislavKrym 5 Dec 2025 23:59 UTC
0 points
0
Unfortunately, Gemini 3 Pro without the Deep Think option managed to one-shot the problems 1,3,4,5 of the IMO 2025. I doubt that if we prompt the system to solve the problems one by one, then we won’t obtain the result of solving all but the Problem 6.
EDIT: fortunately, solving all the problems was a failure. Unfortunately, prompting the model to solve them one by one saw a great success of solving the problems 1, 3 and 5, failing the problem 2, encountered an error doing Problem 4.
- mishka 6 Dec 2025 1:09 UTC
  2 points
  0
  Parent
  That’s good (assuming no contamination, of course (I don’t expect it to break instructions not to search, but it could have seen them at some of the training phases)).
  
  But this will be possible to double-check in the future with novel problems.
  
  (I assume someone checked the correctness of these versions of solutions; this is just a conversation, but someone needs to assert checking the details.)