Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization

by   Kevin J. Shih, et al.

We present a simple deep learning framework to simultaneously predict keypoint locations and their respective visibilities and use those to achieve state-of-the-art performance for fine-grained classification. We show that by conditioning the predictions on object proposals with sufficient image support, our method can do well without complicated spatial reasoning. Instead, inference methods with robustness to outliers, yield state-of-the-art for keypoint localization. We demonstrate the effectiveness of our accurate keypoint localization and visibility prediction on the fine-grained bird recognition task with and without ground truth bird bounding boxes, and outperform existing state-of-the-art methods by over 2


page 5

page 9

page 10


The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition

Current approaches for fine-grained recognition do the following: First,...

Fine-grained Visual Categorization using PAIRS: Pose and Appearance Integration for Recognizing Subcategories

In Fine-grained Visual Categorization (FGVC), the differences between si...

Hypercolumns for Object Segmentation and Fine-grained Localization

Recognition algorithms based on convolutional networks (CNNs) typically ...

Human 3D keypoints via spatial uncertainty modeling

We introduce a technique for 3D human keypoint estimation that directly ...

Long-Lived Accurate Keypoints in Event Streams

We present a novel end-to-end approach to keypoint detection and trackin...

Neural Activation Constellations: Unsupervised Part Model Discovery with Convolutional Networks

Part models of object categories are essential for challenging recogniti...

Interpretable and Accurate Fine-grained Recognition via Region Grouping

We present an interpretable deep model for fine-grained visual recogniti...