Diversified Visual Attention Networks for Fine-Grained Object Classification

06/28/2016
by   Bo Zhao, et al.
0

Fine-grained object classification is a challenging task due to the subtle inter-class difference and large intra-class variation. Recently, visual attention models have been applied to automatically localize the discriminative regions of an image for better capturing critical difference and demonstrated promising performance. However, without consideration of the diversity in attention process, most of existing attention models perform poorly in classifying fine-grained objects. In this paper, we propose a diversified visual attention network (DVAN) to address the problems of fine-grained object classification, which substan- tially relieves the dependency on strongly-supervised information for learning to localize discriminative regions compared with attentionless models. More importantly, DVAN explicitly pursues the diversity of attention and is able to gather discriminative information to the maximal extent. Multiple attention canvases are generated to extract convolutional features for attention. An LSTM recurrent unit is employed to learn the attentiveness and discrimination of attention canvases. The proposed DVAN has the ability to attend the object from coarse to fine granularity, and a dynamic internal representation for classification is built up by incrementally combining the information from different locations and scales of the image. Extensive experiments con- ducted on CUB-2011, Stanford Dogs and Stanford Cars datasets have demonstrated that the proposed diversified visual attention networks achieve competitive performance compared to the state- of-the-art approaches, without using any prior knowledge, user interaction or external resource in training or testing.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 9

page 11

research
09/06/2019

Coarse2Fine: A Two-stage Training Method for Fine-grained Visual Classification

Small inter-class and large intra-class variations are the main challeng...
research
03/22/2016

Fully Convolutional Attention Networks for Fine-Grained Recognition

Fine-grained recognition is challenging due to its subtle local inter-cl...
research
09/25/2019

Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization

Fine-grained visual categorization (FGVC) is an important but challengin...
research
07/23/2019

Few-shot Learning for Domain-specfic Fine-grained Image Classfication

Learning to recognize novel visual categories from a few examples is a c...
research
01/18/2019

Multisource Region Attention Network for Fine-Grained Object Recognition in Remote Sensing Imagery

Fine-grained object recognition concerns the identification of the type ...
research
05/11/2023

Salient Mask-Guided Vision Transformer for Fine-Grained Classification

Fine-grained visual classification (FGVC) is a challenging computer visi...
research
09/28/2022

SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval

In this paper, we propose Suppression-Enhancing Mask based attention and...

Please sign up or login with your details

Forgot password? Click here to reset