Associating Multi-Scale Receptive Fields for Fine-grained Recognition

05/19/2020
by   Zihan Ye, et al.
0

Extracting and fusing part features have become the key of fined-grained image recognition. Recently, Non-local (NL) module has shown excellent improvement in image recognition. However, it lacks the mechanism to model the interactions between multi-scale part features, which is vital for fine-grained recognition. In this paper, we propose a novel cross-layer non-local (CNL) module to associate multi-scale receptive fields by two operations. First, CNL computes correlations between features of a query layer and all response layers. Second, all response features are weighted according to the correlations and are added to the query features. Due to the interactions of cross-layer features, our model builds spatial dependencies among multi-level layers and learns more discriminative features. In addition, we can reduce the aggregation cost if we set low-dimensional deep layer as query layer. Experiments are conducted to show our model achieves or surpasses state-of-the-art results on three benchmark datasets of fine-grained classification. Our codes can be found at github.com/FouriYe/CNL-ICIP2020.

READ FULL TEXT

page 1

page 3

research
10/31/2018

Compact Generalized Non-local Network

The non-local module is designed for capturing long-range spatio-tempora...
research
05/15/2021

One for All: An End-to-End Compact Solution for Hand Gesture Recognition

The HGR is a quite challenging task as its performance is influenced by ...
research
06/14/2018

Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition

Attention-based learning for fine-grained image recognition remains a ch...
research
07/25/2021

Adaptive Recursive Circle Framework for Fine-grained Action Recognition

How to model fine-grained spatial-temporal dynamics in videos has been a...
research
07/14/2023

Complementary Frequency-Varying Awareness Network for Open-Set Fine-Grained Image Recognition

Open-set image recognition is a challenging topic in computer vision. Mo...
research
06/28/2021

Prior-Induced Information Alignment for Image Matting

Image matting is an ill-posed problem that aims to estimate the opacity ...
research
12/16/2022

DQnet: Cross-Model Detail Querying for Camouflaged Object Detection

Camouflaged objects are seamlessly blended in with their surroundings, w...

Please sign up or login with your details

Forgot password? Click here to reset