I got the model up to 3,000 tokens/s on a particularly long/easy query.
As an FYI, there has been other work on large diffusion language models, such as this: https://www.inceptionlabs.ai/introducing-mercury
Yep, I know about earlier work, but I think that one of the top 3 labs taking this seriously is a big sign.
A sign of what? The Google is hedging their bets?
Is a big sign that if there is something here, it’s likely to be discovered. We’re likely to find out in the next few years of this is the future of general purpose AI.
I got the model up to 3,000 tokens/s on a particularly long/easy query.
As an FYI, there has been other work on large diffusion language models, such as this: https://www.inceptionlabs.ai/introducing-mercury
Yep, I know about earlier work, but I think that one of the top 3 labs taking this seriously is a big sign.
A sign of what? The Google is hedging their bets?
Is a big sign that if there is something here, it’s likely to be discovered. We’re likely to find out in the next few years of this is the future of general purpose AI.