Understanding when spatial transformer networks do not support invariance, and what to do about it

04/24/2020
by   Lukas Finnveden, et al.
0

Spatial transformer networks (STNs) were designed to enable convolutional neural networks (CNNs) to learn invariance to image transformations. STNs were originally proposed to transform CNN feature maps as well as input images. This enables the use of more complex features when predicting transformation parameters. However, since STNs perform a purely spatial transformation, they do not, in the general case, have the ability to align the feature maps of a transformed image with those of its original. STNs are therefore unable to support invariance when transforming CNN feature maps. We present a simple proof for this and study the practical implications, showing that this inability is coupled with decreased classification accuracy. We therefore investigate alternative STN architectures that make use of complex features. We find that while deeper localization networks are difficult to train, localization networks that share parameters with the classification network remain stable as they grow deeper, which allows for higher classification accuracy on difficult datasets. Finally, we explore the interaction between localization network complexity and iterative image alignment.

READ FULL TEXT

page 6

page 7

research
01/14/2020

The problems with using STNs to align CNN feature maps

Spatial transformer networks (STNs) were designed to enable CNNs to lear...
research
04/30/2020

Inability of spatial transformations of CNN feature maps to support invariant recognition

A large number of deep learning architectures use spatial transformation...
research
06/05/2015

Spatial Transformer Networks

Convolutional Neural Networks define an exceptionally powerful class of ...
research
09/19/2022

On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks

In this paper, we aim to improve the mathematical interpretability of co...
research
08/05/2021

Rotaflip: A New CNN Layer for Regularization and Rotational Invariance in Medical Images

Regularization in convolutional neural networks (CNNs) is usually addres...
research
05/28/2017

Dilated Residual Networks

Convolutional networks for image classification progressively reduce res...
research
07/18/2020

Volumetric Transformer Networks

Existing techniques to encode spatial invariance within deep convolution...

Please sign up or login with your details

Forgot password? Click here to reset