Deep Aggregation of Regional Convolutional Activations for Content Based Image Retrieval

09/20/2019
by   Konstantin Schall, et al.
0

One of the key challenges of deep learning based image retrieval remains in aggregating convolutional activations into one highly representative feature vector. Ideally, this descriptor should encode semantic, spatial and low level information. Even though off-the-shelf pre-trained neural networks can already produce good representations in combination with aggregation methods, appropriate fine tuning for the task of image retrieval has shown to significantly boost retrieval performance. In this paper, we present a simple yet effective supervised aggregation method built on top of existing regional pooling approaches. In addition to the maximum activation of a given region, we calculate regional average activations of extracted feature maps. Subsequently, weights for each of the pooled feature vectors are learned to perform a weighted aggregation to a single feature vector. Furthermore, we apply our newly proposed NRA loss function for deep metric learning to fine tune the backbone neural network and to learn the aggregation weights. Our method achieves state-of-the-art results for the INRIA Holidays data set and competitive results for the Oxford Buildings and Paris data sets while reducing the training time significantly.

READ FULL TEXT
research
11/19/2018

Weakly Supervised Soft-detection-based Aggregation Method for Image Retrieval

In recent year, the compact representations based on activations of Conv...
research
04/15/2021

Learning Regional Attention over Multi-resolution Deep Convolutional Features for Trademark Retrieval

Large-scale trademark retrieval is an important content-based image retr...
research
03/03/2017

Context Aware Query Image Representation for Particular Object Retrieval

The current models of image representation based on Convolutional Neural...
research
03/20/2018

Adaptive Co-weighting Deep Convolutional Features For Object Retrieval

Aggregating deep convolutional features into a global image vector has a...
research
07/12/2019

ACTNET: end-to-end learning of feature activations and aggregation for effective instance image retrieval

We propose a novel CNN architecture called ACTNET for robust instance im...
research
07/12/2019

ACTNET: end-to-end learning of feature activations and multi-stream aggregation for effective instance image retrieval

We propose a novel CNN architecture called ACTNET for robust instance im...
research
04/03/2018

Unsupervised Semantic-based Aggregation of Deep Convolutional Features

In this paper, we propose a simple but effective semantic-based aggregat...

Please sign up or login with your details

Forgot password? Click here to reset