Misunderstood the resolution terms. ARC-AGI-2 submissions that are eligible for prizes are constrained as follows:
Unlike the public leaderboard on arcprize.org, Kaggle rules restrict you from using internet APIs, and you only get ~$50 worth of compute per submission. In order to be eligible for prizes, contestants must open source and share their solution and work into the public domain at the end of the competition.
Grok 4 doesn’t count, and whatever frontier model beats it won’t count either. The relevant resolution criterion for frontier model performance on the task is “top score at the public leaderboard”. I haven’t found a market for that.
(You can see how the market in which I hastily made that bet didn’t move in response to Grok 4. That made me suspicious, so I actually read the details, and, well, kind of embarrassing.)
You sold, what changed your mind?
Misunderstood the resolution terms. ARC-AGI-2 submissions that are eligible for prizes are constrained as follows:
Grok 4 doesn’t count, and whatever frontier model beats it won’t count either. The relevant resolution criterion for frontier model performance on the task is “top score at the public leaderboard”. I haven’t found a market for that.
(You can see how the market in which I hastily made that bet didn’t move in response to Grok 4. That made me suspicious, so I actually read the details, and, well, kind of embarrassing.)