All the attention you need: Global-local, spatial-channel attention for image retrieval

07/16/2021
by   Chull Hwan Song, et al.
0

We address representation learning for large-scale instance-level image retrieval. Apart from backbone, training pipelines and loss functions, popular approaches have focused on different spatial pooling and attention mechanisms, which are at the core of learning a powerful global image representation. There are different forms of attention according to the interaction of elements of the feature tensor (local and global) and the dimensions where it is applied (spatial and channel). Unfortunately, each study addresses only one or two forms of attention and applies it to different problems like classification, detection or retrieval. We present global-local attention module (GLAM), which is attached at the end of a backbone network and incorporates all four forms of attention: local and global, spatial and channel. We obtain a new feature tensor and, by spatial pooling, we learn a powerful embedding for image retrieval. Focusing on global descriptors, we provide empirical evidence of the interaction of all forms of attention and improve the state of the art on standard benchmarks.

READ FULL TEXT

page 2

page 5

page 6

research
06/23/2018

Leveraging Implicit Spatial Information in Global Features for Image Retrieval

Most image retrieval methods use global features that aggregate local di...
research
01/24/2020

SOLAR: Second-Order Loss and Attention for Image Retrieval

Recent works in deep-learning have shown that utilising second-order inf...
research
11/19/2014

A Pooling Approach to Modelling Spatial Relations for Image Retrieval and Annotation

Over the last two decades we have witnessed strong progress on modeling ...
research
11/10/2022

HSGNet: Object Re-identification with Hierarchical Similarity Graph Network

Object re-identification method is made up of backbone network, feature ...
research
05/15/2019

Local Features and Visual Words Emerge in Activations

We propose a novel method of deep spatial matching (DSM) for image retri...
research
07/01/2022

DALG: Deep Attentive Local and Global Modeling for Image Retrieval

Deeply learned representations have achieved superior image retrieval pe...
research
05/08/2022

Adversarial Learning of Hard Positives for Place Recognition

Image retrieval methods for place recognition learn global image descrip...

Please sign up or login with your details

Forgot password? Click here to reset