Algon comments on Paper: Transformers learn in-context by gradient descent