Exploring Corruption Robustness: Inductive Biases in Vision Transformers and MLP-Mixers

06/24/2021 ∙ by Katelyn Morrison, et al. ∙ 0

Recently, vision transformers and MLP-based models have been developed in order to address some of the prevalent weaknesses in convolutional neural networks. Due to the novelty of transformers being used in this domain along with the self-attention mechanism, it remains unclear to what degree these architectures are robust to corruptions. Despite some works proposing that data augmentation remains essential for a model to be robust against corruptions, we propose to explore the impact that the architecture has on corruption robustness. We find that vision transformer architectures are inherently more robust to corruptions than the ResNet-50 and MLP-Mixers. We also find that vision transformers with 5 times fewer parameters than a ResNet-50 have more shape bias. Our code is available to reproduce.



There are no comments yet.


page 1

page 2

page 3

page 4

Code Repositories


We investigated corruption robustness across different architectures including Convolutional Neural Networks, Vision Transformers, and the MLP-Mixer.

view repo
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.