So how could I have thought that faster might actually be a sensible training trick for reasoning models.
So how could I have thought that faster might actually be a sensible training trick for reasoning models.