The number of different benchmarks and metrics we are using to understand each new model is crazy. I’m so confused. The exec summary helps, but... I don’t think the relative difference between models is big enough to justify switching from the one you’re currently used to.
The number of different benchmarks and metrics we are using to understand each new model is crazy. I’m so confused. The exec summary helps, but...
I don’t think the relative difference between models is big enough to justify switching from the one you’re currently used to.