Fine-tuning CNN Image Retrieval with No Human Annotation

11/03/2017
by   Filip Radenovic, et al.
0

Image descriptors based on activations of Convolutional Neural Networks (CNNs) have become dominant in image retrieval due to their discriminative power, compactness of the representation, and the efficiency of search. Training of CNNs, either from scratch or fine-tuning, requires a large amount of annotated data, where high quality of the annotation is often crucial. In this work, we propose to fine-tune CNNs for image retrieval on a large collection of unordered images in a fully automatic manner. Reconstructed 3D models, obtained by the state-of-the-art retrieval and structure-from-motion methods, guide the selection of the training data. We show that both hard positive and hard negative examples, selected by exploiting the geometry and the camera positions available from the 3D models, enhance the performance in particular object retrieval. CNN descriptor whitening discriminatively learned from the same training data outperforms the commonly used PCA whitening. We propose a novel trainable Generalized-Mean (GeM) pooling layer that generalizes max and average pooling and show that it boosts retrieval performance. Applying the proposed method on VGG network achieves state-of-the-art performance on standard benchmarks: Oxford Buildings, Paris, and Holidays datasets.

READ FULL TEXT

page 2

page 4

page 5

page 7

page 11

research
04/08/2016

CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples

Convolutional Neural Networks (CNNs) achieve state-of-the-art performanc...
research
11/26/2018

Matchable Image Retrieval by Learning from Surface Reconstruction

Convolutional Neural Networks (CNNs) have achieved superior performance ...
research
11/01/2018

Attention-aware Generalized Mean Pooling for Image Retrieval

It has been shown that image descriptors extracted by convolutional neur...
research
02/05/2020

Enhancing Feature Invariance with Learned Image Transformations for Image Retrieval

Off-the-shelf convolutional neural network features achieve state-of-the...
research
05/08/2022

Adversarial Learning of Hard Positives for Place Recognition

Image retrieval methods for place recognition learn global image descrip...
research
06/22/2018

An accurate retrieval through R-MAC+ descriptors for landmark recognition

The landmark recognition problem is far from being solved, but with the ...
research
06/08/2018

DeepFirearm: Learning Discriminative Feature Representation for Fine-grained Firearm Retrieval

There are great demands for automatically regulating inappropriate appea...

Please sign up or login with your details

Forgot password? Click here to reset