Visual Representations: Defining Properties and Deep Approximations

11/27/2014
by   Stefano Soatto, et al.
0

Visual representations are defined in terms of minimal sufficient statistics of visual data, for a class of tasks, that are also invariant to nuisance variability. Minimal sufficiency guarantees that we can store a representation in lieu of raw data with smallest complexity and no performance loss on the task at hand. Invariance guarantees that the statistic is constant with respect to uninformative transformations of the data. We derive analytical expressions for such representations and show they are related to feature descriptors commonly used in computer vision, as well as to convolutional neural networks. This link highlights the assumptions and approximations tacitly assumed by these methods and explains empirical practices such as clamping, pooling and joint normalization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2014

Visual Scene Representations: Contrast, Scaling and Occlusion

We study the structure of representations, defined as approximations of ...
research
03/01/2017

Graph-based Isometry Invariant Representation Learning

Learning transformation invariant representations of visual data is an i...
research
08/21/2018

Isometric Transformation Invariant Graph-based Deep Neural Network

Learning transformation invariant representations of visual data is an i...
research
03/15/2016

Nested Invariance Pooling and RBM Hashing for Image Instance Retrieval

The goal of this work is the computation of very compact binary hashes f...
research
04/01/2014

A Deep Representation for Invariance And Music Classification

Representations in the auditory cortex might be based on mechanisms simi...
research
11/21/2014

Understanding image representations by measuring their equivariance and equivalence

Despite the importance of image representations such as histograms of or...

Please sign up or login with your details

Forgot password? Click here to reset