ACTNET: end-to-end learning of feature activations and aggregation for effective instance image retrieval

07/12/2019
by   Syed Sameed Husain, et al.
2

We propose a novel CNN architecture called ACTNET for robust instance image retrieval from large-scale datasets. Our key innovation is a learnable activation layer designed to improve the signal-to-noise ratio (SNR) of deep convolutional feature maps. This works in tandem with multi-stream aggregation, where complementary deep features from different convolutional layers are transformed and balanced, using our novel activation layer, before aggregation into a global descriptor. Importantly, the learnable parameters of activation blocks are explicitly trained, jointly with the CNN parameters, in an end-to-end manner minimising triplet loss. This means that our network jointly learns the CNN filters and their optimal aggregation for the retrieval task. To our knowledge, this is the first time parametric functions are used to control and learn optimal aggregation. We conduct an in-depth experimental study on three non-linear activation functions: Sine-Hyperbolic, Exponential and modified Weibull, showing that while all bring significant gains the Weibull function performs best thanks to its ability to equalise strong activations. The results clearly demonstrate that activation functions significantly enhance the discriminative power of deep features, leading to state-of-the-art retrieval results.

READ FULL TEXT

page 3

page 5

page 7

page 11

research
07/12/2019

ACTNET: end-to-end learning of feature activations and multi-stream aggregation for effective instance image retrieval

We propose a novel CNN architecture called ACTNET for robust instance im...
research
06/15/2019

REMAP: Multi-layer entropy-guided pooling of dense CNN features for image retrieval

This paper addresses the problem of very large-scale image retrieval, fo...
research
09/20/2019

Deep Aggregation of Regional Convolutional Activations for Content Based Image Retrieval

One of the key challenges of deep learning based image retrieval remains...
research
11/21/2019

DeepLABNet: End-to-end Learning of Deep Radial Basis Networks with Fully Learnable Basis Functions

From fully connected neural networks to convolutional neural networks, t...
research
12/04/2020

DenserNet: Weakly Supervised Visual Localization Using Multi-scale Feature Aggregation

In this work, we introduce a Denser Feature Network (DenserNet) for visu...
research
03/03/2019

MILDNet: A Lightweight Single Scaled Deep Ranking Architecture

Multi-scale deep CNN architecture [1, 2, 3] successfully captures both f...
research
03/20/2018

Adaptive Co-weighting Deep Convolutional Features For Object Retrieval

Aggregating deep convolutional features into a global image vector has a...

Please sign up or login with your details

Forgot password? Click here to reset