benjamin ar comments on Tensor-Transformer Variants are Surprisingly Performant