"Mental Rotation" by Optimizing Transforming Distance

06/11/2014
by   Weiguang Ding, et al.
0

The human visual system is able to recognize objects despite transformations that can drastically alter their appearance. To this end, much effort has been devoted to the invariance properties of recognition systems. Invariance can be engineered (e.g. convolutional nets), or learned from data explicitly (e.g. temporal coherence) or implicitly (e.g. by data augmentation). One idea that has not, to date, been explored is the integration of latent variables which permit a search over a learned space of transformations. Motivated by evidence that people mentally simulate transformations in space while comparing examples, so-called "mental rotation", we propose a transforming distance. Here, a trained relational model actively transforms pairs of examples so that they are maximally similar in some feature space yet respect the learned transformational constraints. We apply our method to nearest-neighbour problems on the Toronto Face Database and NORB.

READ FULL TEXT
research
06/11/2019

Learning robust visual representations using data augmentation invariance

Deep convolutional neural networks trained for image object categorizati...
research
04/20/2020

Invariant Integration in Deep Convolutional Feature Space

In this contribution, we show how to incorporate prior knowledge to a de...
research
07/05/2017

Improving Content-Invariance in Gated Autoencoders for 2D and 3D Object Rotation

Content-invariance in mapping codes learned by GAEs is a useful feature ...
research
10/04/2021

Learning Online Visual Invariances for Novel Objects via Supervised and Self-Supervised Training

Humans can identify objects following various spatial transformations su...
research
11/23/2020

Learnable Gabor modulated complex-valued networks for orientation robustness

Robustness to transformation is desirable in many computer vision tasks,...
research
02/17/2017

Dataset Augmentation in Feature Space

Dataset augmentation, the practice of applying a wide array of domain-sp...
research
07/03/2017

Appearance invariance in convolutional networks with neighborhood similarity

We present a neighborhood similarity layer (NSL) which induces appearanc...

Please sign up or login with your details

Forgot password? Click here to reset