p.b. comments on AI #9: The Merge and the Million Tokens

p.b. 1 May 2023 20:27 UTC
6 points
5
The “million token” recurrent memory transformer was first published July 2022. The new paper is just an investigation whether the method can also be used for BERT-like encoder models.
Given that there was a ton of papers that “solved” the quadratic bottleneck I wouldn’t hold my breath.