Fine-Grained Visual Classification with Efficient End-to-end Localization

05/11/2020
by   Harald Hanselmann, et al.
0

The term fine-grained visual classification (FGVC) refers to classification tasks where the classes are very similar and the classification model needs to be able to find subtle differences to make the correct prediction. State-of-the-art approaches often include a localization step designed to help a classification network by localizing the relevant parts of the input images. However, this usually requires multiple iterations or passes through a full classification network or complex training schedules. In this work we present an efficient localization module that can be fused with a classification network in an end-to-end setup. On the one hand the module is trained by the gradient flowing back from the classification network. On the other hand, two self-supervised loss functions are introduced to increase the localization accuracy. We evaluate the new model on the three benchmark datasets CUB200-2011, Stanford Cars and FGVC-Aircraft and are able to achieve competitive recognition performance.

READ FULL TEXT

page 1

page 7

research
02/09/2023

Drawing Attention to Detail: Pose Alignment through Self-Attention for Fine-Grained Object Classification

Intra-class variations in the open world lead to various challenges in c...
research
11/17/2019

ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding

The task of fine-grained visual classification (FGVC) deals with classif...
research
12/06/2018

Guided Zoom: Questioning Network Evidence for Fine-grained Classification

We propose Guided Zoom, an approach that utilizes spatial grounding to m...
research
12/14/2019

Fine-grained Recognition: Accounting for Subtle Differences between Similar Classes

The main requisite for fine-grained recognition task is to focus on subt...
research
10/04/2016

Real Time Fine-Grained Categorization with Accuracy and Interpretability

A well-designed fine-grained categorization system usually has three con...
research
05/15/2021

One for All: An End-to-End Compact Solution for Hand Gesture Recognition

The HGR is a quite challenging task as its performance is influenced by ...
research
04/28/2021

Classification and comparison of license plates localization algorithms

The Intelligent Transportation Systems (ITS) are the subject of a world ...

Please sign up or login with your details

Forgot password? Click here to reset