Appearance invariance in convolutional networks with neighborhood similarity

07/03/2017
by   Tolga Tasdizen, et al.
0

We present a neighborhood similarity layer (NSL) which induces appearance invariance in a network when used in conjunction with convolutional layers. We are motivated by the observation that, even though convolutional networks have low generalization error, their generalization capability does not extend to samples which are not represented by the training data. For instance, while novel appearances of learned concepts pose no problem for the human visual system, feedforward convolutional networks are generally not successful in such situations. Motivated by the Gestalt principle of grouping with respect to similarity, the proposed NSL transforms its input feature map using the feature vectors at each pixel as a frame of reference, i.e. center of attention, for its surrounding neighborhood. This transformation is spatially varying, hence not a convolution. It is differentiable; therefore, networks including the proposed layer can be trained in an end-to-end manner. We analyze the invariance of NSL to significant changes in appearance that are not represented in the training data. We also demonstrate its advantages for digit recognition, semantic labeling and cell detection problems.

READ FULL TEXT

page 5

page 8

page 9

research
08/07/2023

On genuine invariance learning without weight-tying

In this paper, we investigate properties and limitations of invariance l...
research
02/24/2017

How ConvNets model Non-linear Transformations

In this paper, we theoretically address three fundamental problems invol...
research
01/16/2013

Learning Stable Group Invariant Representations with Convolutional Networks

Transformation groups, such as translations or rotations, effectively ex...
research
02/07/2020

Revisiting Spatial Invariance with Low-Rank Local Connectivity

Convolutional neural networks are among the most successful architecture...
research
12/20/2014

Visual Instance Retrieval with Deep Convolutional Networks

This paper provides an extensive study on the availability of image repr...
research
07/15/2023

Improving Translation Invariance in Convolutional Neural Networks with Peripheral Prediction Padding

Zero padding is often used in convolutional neural networks to prevent t...
research
06/11/2014

"Mental Rotation" by Optimizing Transforming Distance

The human visual system is able to recognize objects despite transformat...

Please sign up or login with your details

Forgot password? Click here to reset