Here’s another paper on small / non-robust features, but rather specific to patch-based vision transformers: Understanding and Improving Robustness of Vision Transformers through Patch-based Negative Augmentation^ This work is very specific to patch-based methods. Whether patches are here to stay and for how long is unclear to me, but right now they seem to be on an ascendancy (?).
Here’s another paper on small / non-robust features, but rather specific to patch-based vision transformers:
Understanding and Improving Robustness of Vision Transformers through Patch-based Negative Augmentation
^ This work is very specific to patch-based methods. Whether patches are here to stay and for how long is unclear to me, but right now they seem to be on an ascendancy (?).