Graph-based Isometry Invariant Representation Learning

03/01/2017
by   Renata Khasanova, et al.
0

Learning transformation invariant representations of visual data is an important problem in computer vision. Deep convolutional networks have demonstrated remarkable results for image and video classification tasks. However, they have achieved only limited success in the classification of images that undergo geometric transformations. In this work we present a novel Transformation Invariant Graph-based Network (TIGraNet), which learns graph-based features that are inherently invariant to isometric transformations such as rotation and translation of input images. In particular, images are represented as signals on graphs, which permits to replace classical convolution and pooling layers in deep networks with graph spectral convolution and dynamic graph pooling layers that together contribute to invariance to isometric transformations. Our experiments show high performance on rotated and translated images from the test set compared to classical architectures that are very sensitive to transformations in the data. The inherent invariance properties of our framework provide key advantages, such as increased resiliency to data variability and sustained performance with limited training sets.

READ FULL TEXT

page 1

page 9

page 11

research
08/21/2018

Isometric Transformation Invariant Graph-based Deep Neural Network

Learning transformation invariant representations of visual data is an i...
research
06/27/2012

Learning Invariant Representations with Local Transformations

Learning invariant representations is an important problem in machine le...
research
02/04/2015

Learning Local Invariant Mahalanobis Distances

For many tasks and data types, there are natural transformations to whic...
research
04/01/2014

A Deep Representation for Invariance And Music Classification

Representations in the auditory cortex might be based on mechanisms simi...
research
11/29/2021

SPIN: Simplifying Polar Invariance for Neural networks Application to vision-based irradiance forecasting

Translational invariance induced by pooling operations is an inherent pr...
research
11/27/2014

Visual Representations: Defining Properties and Deep Approximations

Visual representations are defined in terms of minimal sufficient statis...
research
11/23/2021

ChebLieNet: Invariant Spectral Graph NNs Turned Equivariant by Riemannian Geometry on Lie Groups

We introduce ChebLieNet, a group-equivariant method on (anisotropic) man...

Please sign up or login with your details

Forgot password? Click here to reset