DeepAI AI Chat
Log In Sign Up

Fine-Grained Visual Recognition with Batch Confusion Norm

by   Yen-Chi Hsu, et al.

We introduce a regularization concept based on the proposed Batch Confusion Norm (BCN) to address Fine-Grained Visual Classification (FGVC). The FGVC problem is notably characterized by its two intriguing properties, significant inter-class similarity and intra-class variations, which cause learning an effective FGVC classifier a challenging task. Inspired by the use of pairwise confusion energy as a regularization mechanism, we develop the BCN technique to improve the FGVC learning by imposing class prediction confusion on each training batch, and consequently alleviate the possible overfitting due to exploring image feature of fine details. In addition, our method is implemented with an attention gated CNN model, boosted by the incorporation of Atrous Spatial Pyramid Pooling (ASPP) to extract discriminative features and proper attentions. To demonstrate the usefulness of our method, we report state-of-the-art results on several benchmark FGVC datasets, along with comprehensive ablation comparisons.


page 1

page 7


Coarse2Fine: A Two-stage Training Method for Fine-grained Visual Classification

Small inter-class and large intra-class variations are the main challeng...

Training with Confusion for Fine-Grained Visual Classification

Research in Fine-Grained Visual Classification has focused on tackling t...

Bi-directional Feature Reconstruction Network for Fine-Grained Few-Shot Image Classification

The main challenge for fine-grained few-shot image classification is to ...

A Gated Peripheral-Foveal Convolutional Neural Network for Unified Image Aesthetic Prediction

Learning fine-grained details is a key issue in image aesthetic assessme...

A Task-aware Dual Similarity Network for Fine-grained Few-shot Learning

The goal of fine-grained few-shot learning is to recognize sub-categorie...

ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding

The task of fine-grained visual classification (FGVC) deals with classif...

Channel DropBlock: An Improved Regularization Method for Fine-Grained Visual Classification

Classifying the sub-categories of an object from the same super-category...