Cross-layer Attention Network for Fine-grained Visual Categorization

10/17/2022
by   Ranran Huang, et al.
0

Learning discriminative representations for subtle localized details plays a significant role in Fine-grained Visual Categorization (FGVC). Compared to previous attention-based works, our work does not explicitly define or localize the part regions of interest; instead, we leverage the complementary properties of different stages of the network, and build a mutual refinement mechanism between the mid-level feature maps and the top-level feature map by our proposed Cross-layer Attention Network (CLAN). Specifically, CLAN is composed of 1) the Cross-layer Context Attention (CLCA) module, which enhances the global context information in the intermediate feature maps with the help of the top-level feature map, thereby improving the expressive power of the middle layers, and 2) the Cross-layer Spatial Attention (CLSA) module, which takes advantage of the local attention in the mid-level feature maps to boost the feature extraction of local regions at the top-level feature maps. Experimental results show our approach achieves state-of-the-art on three publicly available fine-grained recognition datasets (CUB-200-2011, Stanford Cars and FGVC-Aircraft). Ablation studies and visualizations are provided to understand our approach. Experimental results show our approach achieves state-of-the-art on three publicly available fine-grained recognition datasets (CUB-200-2011, Stanford Cars and FGVC-Aircraft).

READ FULL TEXT

page 6

page 7

research
09/06/2019

Coarse2Fine: A Two-stage Training Method for Fine-grained Visual Classification

Small inter-class and large intra-class variations are the main challeng...
research
10/26/2018

Fine-grained Video Categorization with Redundancy Reduction Attention

For fine-grained categorization tasks, videos could serve as a better so...
research
04/22/2019

Stochastic Region Pooling: Make Attention More Expressive

Global Average Pooling (GAP) is used by default on the channel-wise atte...
research
11/27/2018

Generating Attention from Classifier Activations for Fine-grained Recognition

Recent advances in fine-grained recognition utilize attention maps to lo...
research
10/06/2021

Fully Convolutional Cross-Scale-Flows for Image-based Defect Detection

In industrial manufacturing processes, errors frequently occur at unpred...
research
03/04/2021

Learning Granularity-Aware Convolutional Neural Network for Fine-Grained Visual Classification

Locating discriminative parts plays a key role in fine-grained visual cl...
research
04/25/2018

Cross-media Multi-level Alignment with Relation Attention Network

With the rapid growth of multimedia data, such as image and text, it is ...

Please sign up or login with your details

Forgot password? Click here to reset