Cascade one-vs-rest detection network for fine-grained recognition without part annotations

02/28/2017
by   Long Chen, et al.
0

Fine-grained recognition is a challenging task due to the small intra-category variances. Most of top-performing fine-grained recognition methods leverage parts of objects for better performance. Therefore, part annotations which are extremely computationally expensive are required. In this paper, we propose a novel cascaded deep CNN detection framework for fine-grained recognition which is trained to detect the whole object without considering parts. Nevertheless, most of current top-performing detection networks use the N+1 class (N object categories plus background) softmax loss, and the background category with much more training samples dominates the feature learning progress so that the features are not good for object categories with fewer samples. To bridge this gap, we introduce a cascaded structure to eliminate background and exploit a one-vs-rest loss to capture more minute variances among different subordinate categories. Experiments show that our proposed recognition framework achieves comparable performance with state-of-the-art, part-free, fine-grained recognition methods on the CUB-200-2011 Bird dataset. Moreover, our method even outperforms most of part-based methods while does not need part annotations at the training stage and is free from any annotations at test stage.

READ FULL TEXT
research
07/31/2018

Improving the Annotation of DeepFashion Images for Fine-grained Attribute Recognition

DeepFashion is a widely used clothing dataset with 50 categories and mor...
research
12/01/2022

On Utilizing Relationships for Transferable Few-Shot Fine-Grained Object Detection

State-of-the-art object detectors are fast and accurate, but they requir...
research
10/16/2019

RGB-D Individual Segmentation

Fine-grained recognition task deals with sub-category classification pro...
research
01/17/2021

Improving Apparel Detection with Category Grouping and Multi-grained Branches

Training an accurate object detector is expensive and time-consuming. On...
research
03/09/2020

Cascaded Human-Object Interaction Recognition

Rapid progress has been witnessed for human-object interaction (HOI) rec...
research
06/27/2022

PARTICUL: Part Identification with Confidence measure using Unsupervised Learning

In this paper, we present PARTICUL, a novel algorithm for unsupervised l...
research
12/08/2021

Progressive Multi-stage Interactive Training in Mobile Network for Fine-grained Recognition

Fine-grained Visual Classification (FGVC) aims to identify objects from ...

Please sign up or login with your details

Forgot password? Click here to reset