Expertium comments on AI 2027: What Superintelligence Looks Like

Expertium 19 Apr 2025 22:04 UTC
1 point
−1
they put substantial probability on the trend being superexponential
I think that’s too speculative.
I also think that around 25-50% of the questions are impossible or mislabeled.
I wouldn’t be surprised if 3-5% of questions were mislabeled or impossible to answer, but 25-50%? You’re basically saying that HLE is worthless. I’m curious why. I mean, I don’t know much about the people who had to sift through all of the submissions, but I’d be surprised if they failed that badly. Plus, there was a “bug bounty” aimed at improving the quality of the dataset.
TBC, my median to superhuman coder is more like 2031.
Guess I’m a pessimist then, mine is more like 2034.
- ryan_greenblatt 19 Apr 2025 22:15 UTC
  9 points
  4
  Parent
  
  I wouldn’t be surprised if 3-5% of questions were mislabeled or impossible to answer, but 25-50%? You’re basically saying that HLE is worthless. I’m curious why.
  
  Various people looked at randomly selected questions and found similar numbers.
  
  (I don’t think the dataset is worthless, I think if you filtered down to the best 25-50% of questions it would be a reasonable dataset with acceptable error rate.)