Rauno Arike comments on OpenAI Claims IMO Gold Medal

Rauno Arike 19 Jul 2025 13:15 UTC
12 points
2
The writing style looks fairly similar to the examples shown in Baker et al. (2025), so it seems plausible that this is a general consequence of doing a lot of RL training, rather than something specific to the methodology used for this model. It’s still concerning, but I’m happy that it doesn’t look noticeably less readable than the examples in the Baker et al paper.