Why do deep convolutional networks generalize so poorly to small image transformations?

05/30/2018
by   Aharon Azulay, et al.
4

Deep convolutional network architectures are often assumed to guarantee generalization for small image translations and deformations. In this paper we show that modern CNNs (VGG16, ResNet50, and InceptionResNetV2) can drastically change their output when an image is translated in the image plane by a few pixels, and that this failure of generalization also happens with other realistic small image transformations. Furthermore, the deeper the network the more we see these failures to generalize. We show that these failures are related to the fact that the architecture of modern CNNs ignores the classical sampling theorem so that generalization is not guaranteed. We also show that biases in the statistics of commonly used image datasets makes it unlikely that CNNs will learn to be invariant to these transformations. Taken together our results suggest that the performance of CNNs in object recognition falls far short of the generalization capabilities of humans.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 7

research
06/15/2018

Seeing Neural Networks Through a Box of Toys: The Toybox Dataset of Visual Object Transformations

Deep convolutional neural networks (CNNs) have enjoyed tremendous succes...
research
02/12/2021

A Too-Good-to-be-True Prior to Reduce Shortcut Reliance

Despite their impressive performance in object recognition and other tas...
research
04/09/2015

What Do Deep CNNs Learn About Objects?

Deep convolutional neural networks learn extremely powerful image repres...
research
01/31/2022

Rigidity Preserving Image Transformations and Equivariance in Perspective

We characterize the class of image plane transformations which realize r...
research
06/29/2023

Restore Translation Using Equivariant Neural Networks

Invariance to spatial transformations such as translations and rotations...
research
12/20/2014

Permutohedral Lattice CNNs

This paper presents a convolutional layer that is able to process sparse...
research
04/25/2019

Making Convolutional Networks Shift-Invariant Again

Modern convolutional networks are not shift-invariant, as small input sh...

Please sign up or login with your details

Forgot password? Click here to reset