The Application of Two-level Attention Models in Deep Convolutional Neural Network for Fine-grained Image Classification

11/24/2014
by   Tianjun Xiao, et al.
0

Fine-grained classification is challenging because categories can only be discriminated by subtle and local differences. Variances in the pose, scale or rotation usually make the problem more difficult. Most fine-grained classification systems follow the pipeline of finding foreground object or object parts (where) to extract discriminative features (what). In this paper, we propose to apply visual attention to fine-grained classification task using deep neural network. Our pipeline integrates three types of attention: the bottom-up attention that propose candidate patches, the object-level top-down attention that selects relevant patches to a certain object, and the part-level top-down attention that localizes discriminative parts. We combine these attentions to train domain-specific deep nets, then use it to improve both the what and where aspects. Importantly, we avoid using expensive annotations like bounding box or part information from end-to-end. The weak supervision constraint makes our work easier to generalize. We have verified the effectiveness of the method on the subsets of ILSVRC2012 dataset and CUB200_2011 dataset. Our pipeline delivered significant improvements and achieved the best accuracy under the weakest supervision condition. The performance is competitive against other methods that rely on additional annotations.

READ FULL TEXT

page 1

page 4

research
04/06/2017

Object-Part Attention Model for Fine-grained Image Classification

Fine-grained image classification is to recognize hundreds of subcategor...
research
05/22/2020

Focus Longer to See Better:Recursively Refined Attention for Fine-Grained Image Classification

Deep Neural Network has shown great strides in the coarse-grained image ...
research
11/29/2016

Weakly-supervised Discriminative Patch Learning via CNN for Fine-grained Recognition

Research on fine-grained recognition has recently shifted from multistag...
research
02/04/2021

Mask guided attention for fine-grained patchy image classification

In this work, we present a novel mask guided attention (MGA) method for ...
research
08/16/2021

WikiChurches: A Fine-Grained Dataset of Architectural Styles with Real-World Challenges

We introduce a novel dataset for architectural style classification, con...
research
12/11/2019

Fine-grained Classification of Rowing teams

Fine-grained classification tasks such as identifying different breeds o...
research
08/01/2023

Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark Analysis

Human body-pose estimation is a complex problem in computer vision. Rece...

Please sign up or login with your details

Forgot password? Click here to reset