MultiGrain: a unified image embedding for classes and instances

02/14/2019
by   Maxim Berman, et al.
0

MultiGrain is a network architecture producing compact vector representations that are suited both for image classification and particular object retrieval. It builds on a standard classification trunk. The top of the network produces an embedding containing coarse and fine-grained information, so that images can be recognized based on the object class, particular object, or if they are distorted copies. Our joint training is simple: we minimize a cross-entropy loss for classification and a ranking loss that determines if two images are identical up to data augmentation, with no need for additional labels. A key component of MultiGrain is a pooling layer that allow us to take advantage of high-resolution images with a network trained at a lower resolution. When fed to a linear classifier, the learned embeddings provide state-of-the-art classification accuracy. For instance, we obtain 79.3 accuracy with a ResNet-50 learned on Imagenet, which is a +1.7 improvement over the AutoAugment method. When compared with the cosine similarity, the same embeddings perform on par with the state-of-the-art for image retrieval at moderate resolutions.

READ FULL TEXT

page 1

page 6

research
11/25/2020

Grafit: Learning fine-grained image representations with coarse labels

This paper tackles the problem of learning a finer representation than t...
research
07/19/2016

Information-theoretical label embeddings for large-scale image classification

We present a method for training multi-label, massively multi-class imag...
research
04/18/2016

Selective Convolutional Descriptor Aggregation for Fine-Grained Image Retrieval

Deep convolutional neural network models pre-trained for the ImageNet cl...
research
07/29/2021

A Similarity Measure of Histopathology Images by Deep Embeddings

Histopathology digital scans are large-size images that contain valuable...
research
01/11/2019

Retrieving Similar E-Commerce Images Using Deep Learning

In this paper, we propose a deep convolutional neural network for learni...
research
06/14/2019

Fixing the train-test resolution discrepancy

Data-augmentation is key to the training of neural networks for image cl...
research
09/25/2019

Beyond image classification: zooplankton identification with deep vector space embeddings

Zooplankton images, like many other real world data types, have intrinsi...

Please sign up or login with your details

Forgot password? Click here to reset