Representation Learning by Learning to Count

08/22/2017
by   Mehdi Noroozi, et al.
0

We introduce a novel method for representation learning that uses an artificial supervision signal based on counting visual primitives. This supervision signal is obtained from an equivariance relation, which does not require any manual annotation. We relate transformations of images to transformations of the representations. More specifically, we look for the representation that satisfies such relation rather than the transformations that match a given representation. In this paper, we use two image transformations in the context of counting: scaling and tiling. The first transformation exploits the fact that the number of visual primitives should be invariant to scale. The second transformation allows us to equate the total number of visual primitives in each tile to that in the whole image. These two transformations are combined in one constraint and used to train a neural network with a contrastive loss. The proposed task produces representations that perform on par or exceed the state of the art in transfer learning benchmarks.

READ FULL TEXT

page 1

page 3

page 7

page 8

research
10/19/2020

Improving Transformation Invariance in Contrastive Representation Learning

We propose methods to strengthen the invariance properties of representa...
research
11/24/2019

Towards a Hypothesis on Visual Transformation based Self-Supervision

We propose the first qualitative hypothesis characterizing the behavior ...
research
03/30/2016

Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles

In this paper we study the problem of image representation learning with...
research
04/23/2023

No Free Lunch in Self Supervised Representation Learning

Self-supervised representation learning in computer vision relies heavil...
research
10/18/2022

Towards Efficient and Effective Self-Supervised Learning of Visual Representations

Self-supervision has emerged as a propitious method for visual represent...
research
03/14/2017

Discriminate-and-Rectify Encoders: Learning from Image Transformation Sets

The complexity of a learning task is increased by transformations in the...
research
07/27/2022

Optimizing transformations for contrastive learning in a differentiable framework

Current contrastive learning methods use random transformations sampled ...

Please sign up or login with your details

Forgot password? Click here to reset