Discriminate-and-Rectify Encoders: Learning from Image Transformation Sets

03/14/2017
by   Andrea Tacchetti, et al.
0

The complexity of a learning task is increased by transformations in the input space that preserve class identity. Visual object recognition for example is affected by changes in viewpoint, scale, illumination or planar transformations. While drastically altering the visual appearance, these changes are orthogonal to recognition and should not be reflected in the representation or feature encoding used for learning. We introduce a framework for weakly supervised learning of image embeddings that are robust to transformations and selective to the class distribution, using sets of transforming examples (orbit sets), deep parametrizations and a novel orbit-based loss. The proposed loss combines a discriminative, contrastive part for orbits with a reconstruction error that learns to rectify orbit transformations. The learned embeddings are evaluated in distance metric-based tasks, such as one-shot classification under geometric transformations, as well as face verification and retrieval under more realistic visual variability. Our results suggest that orbit sets, suitably computed or observed, can be used for efficient, weakly-supervised learning of semantically relevant image embeddings.

READ FULL TEXT

page 1

page 2

page 5

page 6

page 8

page 9

page 10

page 13

research
10/01/2011

Learning image transformations without training examples

The use of image transformations is essential for efficient modeling and...
research
11/24/2021

Distribution Estimation to Automate Transformation Policies for Self-Supervision

In recent visual self-supervision works, an imitated classification obje...
research
07/27/2017

Learning from Video and Text via Large-Scale Discriminative Clustering

Discriminative clustering has been successfully applied to a number of w...
research
10/07/2021

Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions

We introduce the task of weakly supervised learning for detecting human ...
research
04/10/2022

Deep Embeddings for Robust User-Based Amateur Vocal Percussion Classification

Vocal Percussion Transcription (VPT) is concerned with the automatic det...
research
02/12/2023

Contrastive Learning and the Emergence of Attributes Associations

In response to an object presentation, supervised learning schemes gener...
research
08/22/2017

Representation Learning by Learning to Count

We introduce a novel method for representation learning that uses an art...

Please sign up or login with your details

Forgot password? Click here to reset