Let’s stop making “Intelligence scale” graphs with humans and AI

You’ve probably seen this:

or this:

Shulman and Yudkowsky on AI progress — LessWrong

Or something similar to these examples.

Let’s stop making and spreading these.

Recently, I asked Gemini 2.5 Pro to write a text with precisely 269 words (and even specified that spaces and punctuation don’t count as words), and it gave me a text with 401 words. Of course, there are lots of other examples where LLMs fail in surprising ways, but I like this one because it’s super simple. At the same time, Gemini can write Python code and speak dozens of languages and can most likely beat me at GeoGuessr. Yet at the same time, it sucks at Pokemon.

This suggests that AI is developing in ways that are deeply inhuman. Can you imagine a human who can write you Python code, then Rust code, then write you a letter in German, then write you a letter in Japanese...and then cannot beat* takes hundreds of hours to beat Pokemon (even when you’re practically holding his hand during every step), can’t count the number of words in the text that he just wrote or write a story without mixing up character names/​ages after the first 10 pages, and can’t order pizza? Can you even imagine a hypothetical environment where a human could grow up to become like that? Even if some comic book crazy scientist wanted to create a human like that on purpose by raising him in a “The Truman Show”-esque dome where everyone is a paid actor, I still don’t think he could succeed.

Nothing like this exists in nature. There is no way to put humans (or animals, for that matter) and AI on the same scale in a coherent way. At least not if the scale has only one dimension.

I think most people, including myself, were expecting that AI (LLMs in particular, I mean) would be progressing at the same rate across all tasks. If that was the case, then putting humans and AI on the same scale would make sense. But we weren’t expecting that AI would be comparable to humans or even better than humans at some tasks while simultaneously being utterly hopeless at other (even closely related!) tasks.

*edit: Gemini has actually finished Pokemon, which I didn’t realize when writing this post. My bad.