Interpretable Attention Guided Network for Fine-grained Visual Classification

03/08/2021
by   Zhenhuan Huang, et al.
5

Fine-grained visual classification (FGVC) is challenging but more critical than traditional classification tasks. It requires distinguishing different subcategories with the inherently subtle intra-class object variations. Previous works focus on enhancing the feature representation ability using multiple granularities and discriminative regions based on the attention strategy or bounding boxes. However, these methods highly rely on deep neural networks which lack interpretability. We propose an Interpretable Attention Guided Network (IAGN) for fine-grained visual classification. The contributions of our method include: i) an attention guided framework which can guide the network to extract discriminitive regions in an interpretable way; ii) a progressive training mechanism obtained to distill knowledge stage by stage to fuse features of various granularities; iii) the first interpretable FGVC method with a competitive performance on several standard FGVC benchmark datasets.

READ FULL TEXT

page 5

page 10

research
03/08/2020

Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches

Fine-grained visual classification (FGVC) is much more challenging than ...
research
01/21/2021

Progressive Co-Attention Network for Fine-grained Visual Classification

Fine-grained visual classification aims to recognize images belonging to...
research
12/08/2021

Progressive Multi-stage Interactive Training in Mobile Network for Fine-grained Recognition

Fine-grained Visual Classification (FGVC) aims to identify objects from ...
research
06/04/2021

Improve the Interpretability of Attention: A Fast, Accurate, and Interpretable High-Resolution Attention Model

The prevalence of employing attention mechanisms has brought along conce...
research
12/06/2018

Guided Zoom: Questioning Network Evidence for Fine-grained Classification

We propose Guided Zoom, an approach that utilizes spatial grounding to m...
research
06/07/2023

Manga Rescreening with Interpretable Screentone Representation

The process of adapting or repurposing manga pages is a time-consuming t...
research
02/09/2020

Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification

Classifying the sub-categories of an object from the same super-category...

Please sign up or login with your details

Forgot password? Click here to reset