Fine-grained Visual Categorization using PAIRS: Pose and Appearance Integration for Recognizing Subcategories

01/27/2018
by   Pei Guo, et al.
0

In Fine-grained Visual Categorization (FGVC), the differences between similar categories are often highly localized to a small number of object parts, and significant pose variation therefore constitutes a great challenge for identification. To address this, we propose extracting image patches using pairs of predicted keypoint locations as anchor points. The benefits of this approach are two-fold: (1) it achieves explicit top-down visual attention on object parts, and (2) the extracted patches are pose-aligned and thus contain stable appearance features. We employ the popular Stacked Hourglass Network to predict keypoint locations, reporting state-of-the-art keypoint localization results on the challenging CUB-200-2011 dataset. Anchored by these predicted keypoints, an overcomplete basis of pose-aligned patches is extracted and a specialized appearance classification network is trained for each patch. An aggregating network is then applied to combine the patch networks' individual predictions, producing a final classification score. Our PAIRS algorithm attains an accuracy of 88.6 state-of-the-art. Enhancing the base PAIRS model with single-keypoint patches produces a further improvement, yielding a new state-of-the-art accuracy of 89.2 pose and appearance features.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 7

page 8

research
07/22/2015

Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization

We present a simple deep learning framework to simultaneously predict ke...
research
01/20/2021

Semi-supervised Keypoint Localization

Knowledge about the locations of keypoints of an object in an image can ...
research
12/26/2015

Part-Stacked CNN for Fine-Grained Visual Categorization

In the context of fine-grained visual categorization, the ability to int...
research
10/07/2009

Visual object categorization with new keypoint-based adaBoost features

We present promising results for visual object categorization, obtained ...
research
08/26/2020

Keypoint-Aligned Embeddings for Image Retrieval and Re-identification

Learning embeddings that are invariant to the pose of the object is cruc...
research
05/09/2023

Unsupervised Writer Retrieval using NetRVLAD and Graph Similarity Reranking

This paper presents an unsupervised approach for writer retrieval based ...
research
06/11/2014

Bird Species Categorization Using Pose Normalized Deep Convolutional Nets

We propose an architecture for fine-grained visual categorization that a...

Please sign up or login with your details

Forgot password? Click here to reset