Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification

02/09/2020
by   Yifeng Ding, et al.
22

Classifying the sub-categories of an object from the same super-category (e.g. bird species, car and aircraft models) in fine-grained visual classification (FGVC) highly relies on discriminative feature representation and accurate region localization. Existing approaches mainly focus on distilling information from high-level features. In this paper, however, we show that by integrating low-level information (e.g. color, edge junctions, texture patterns), performance can be improved with enhanced feature representation and accurately located discriminative regions. Our solution, named Attention Pyramid Convolutional Neural Network (AP-CNN), consists of a) a pyramidal hierarchy structure with a top-down feature pathway and a bottom-up attention pathway, and hence learns both high-level semantic and low-level detailed feature representation, and b) an ROI guided refinement strategy with ROI guided dropblock and ROI guided zoom-in, which refines features with discriminative local regions enhanced and background noises eliminated. The proposed AP-CNN can be trained end-to-end, without the need of additional bounding box/part annotations. Extensive experiments on three commonly used FGVC datasets (CUB-200-2011, Stanford Cars, and FGVC-Aircraft) demonstrate that our approach can achieve state-of-the-art performance. Code available at <http://dwz1.cc/ci8so8a>

READ FULL TEXT

page 1

page 8

research
06/21/2021

Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification

Fine-grained visual classification (FGVC) aims to classify sub-classes o...
research
11/19/2019

Constrained R-CNN: A general image manipulation detection model

Recently, deep learning-based models have exhibited remarkable performan...
research
06/07/2021

Channel DropBlock: An Improved Regularization Method for Fine-Grained Visual Classification

Classifying the sub-categories of an object from the same super-category...
research
03/08/2021

Interpretable Attention Guided Network for Fine-grained Visual Classification

Fine-grained visual classification (FGVC) is challenging but more critic...
research
03/19/2019

3DCarRecog: Car Recognition Using 3D Bounding Box

We present a novel learning framework for vehicle recognition from a sin...
research
12/12/2020

Fine-grained Classification via Categorical Memory Networks

Motivated by the desire to exploit patterns shared across classes, we pr...
research
03/19/2019

Geometry-constrained Car Recognition Using a 3D Perspective Network

We present a novel learning framework for vehicle recognition from a sin...

Please sign up or login with your details

Forgot password? Click here to reset