Lukas_Gloor comments on nikola’s Shortform

Lukas_Gloor 27 Jan 2025 23:23 UTC
5 points
0
In order to submit a question to the benchmark, people had to run it against the listed LLMs; the question would only advance to the next stage once the LLMs used for this testing got it wrong.