Understanding image representations by measuring their equivariance and equivalence

11/21/2014
by   Karel Lenc, et al.
0

Despite the importance of image representations such as histograms of oriented gradients and deep Convolutional Neural Networks (CNN), our theoretical understanding of them remains limited. Aiming at filling this gap, we investigate three key mathematical properties of representations: equivariance, invariance, and equivalence. Equivariance studies how transformations of the input image are encoded by the representation, invariance being a special case where a transformation has no effect. Equivalence studies whether two representations, for example two different parametrisations of a CNN, capture the same visual information or not. A number of methods to establish these properties empirically are proposed, including introducing transformation and stitching layers in CNNs. These methods are then applied to popular representations to reveal insightful aspects of their structure, including clarifying at which layers in a CNN certain geometric invariances are achieved. While the focus of the paper is theoretical, direct applications to structured-output regression are demonstrated too.

READ FULL TEXT

page 1

page 5

page 8

research
11/26/2014

Understanding Deep Image Representations by Inverting Them

Image representations, from SIFT and Bag of Visual Words to Convolutiona...
research
12/07/2015

Visualizing Deep Convolutional Neural Networks Using Natural Pre-Images

Image representations, from SIFT and bag of visual words to Convolutiona...
research
11/09/2020

What Does CNN Shift Invariance Look Like? A Visualization Study

Feature extraction with convolutional neural networks (CNNs) is a popula...
research
09/15/2015

Kernelized Deep Convolutional Neural Network for Describing Complex Images

With the impressive capability to capture visual content, deep convoluti...
research
05/14/2014

Return of the Devil in the Details: Delving Deep into Convolutional Nets

The latest generation of Convolutional Neural Networks (CNN) have achiev...
research
11/27/2014

Visual Representations: Defining Properties and Deep Approximations

Visual representations are defined in terms of minimal sufficient statis...

Please sign up or login with your details

Forgot password? Click here to reset