I think the benchmarks give a misleading impression of the capabilities of AI. It makes it seem like they’re on the verge of being as smart as humans. It makes it sound like they’re ready to take on a bunch of economically valuable activity that they’re not, leading to the issues currently happening with bosses making their employees use LLMs, for example.
I think the benchmarks give a misleading impression of the capabilities of AI. It makes it seem like they’re on the verge of being as smart as humans. It makes it sound like they’re ready to take on a bunch of economically valuable activity that they’re not, leading to the issues currently happening with bosses making their employees use LLMs, for example.