philip_b comments on Tensor-Transformer Variants are Surprisingly Performant