Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification

06/21/2021
by   Chenyu Guo, et al.
0

Fine-grained visual classification (FGVC) aims to classify sub-classes of objects in the same super-class (e.g., species of birds, models of cars). For the FGVC tasks, the essential solution is to find discriminative subtle information of the target from local regions. TraditionalFGVC models preferred to use the refined features,i.e., high-level semantic information for recognition and rarely use low-level in-formation. However, it turns out that low-level information which contains rich detail information also has effect on improving performance. Therefore, in this paper, we propose cross-layer navigation convolutional neural network for feature fusion. First, the feature maps extracted by the backbone network are fed into a convolutional long short-term memory model sequentially from high-level to low-level to perform feature aggregation. Then, attention mechanisms are used after feature fusion to extract spatial and channel information while linking the high-level semantic information and the low-level texture features, which can better locate the discriminative regions for the FGVC. In the experiments, three commonly used FGVC datasets, including CUB-200-2011, Stanford-Cars, andFGVC-Aircraft datasets, are used for evaluation and we demonstrate the superiority of the proposed method by comparing it with other referred FGVC methods to show that this method achieves superior results.

READ FULL TEXT
research
02/09/2020

Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification

Classifying the sub-categories of an object from the same super-category...
research
05/11/2017

Object-Level Context Modeling For Scene Classification with Context-CNN

Convolutional Neural Networks (CNNs) have been used extensively for comp...
research
05/26/2020

Learning Local Features with Context Aggregation for Visual Localization

Keypoint detection and description is fundamental yet important in many ...
research
04/04/2019

Feature Pyramid Hashing

In recent years, deep-networks-based hashing has become a leading approa...
research
10/05/2022

Toward Knowledge-Driven Speech-Based Models of Depression: Leveraging Spectrotemporal Variations in Speech Vowels

Psychomotor retardation associated with depression has been linked with ...
research
04/17/2019

Image Resizing by Reconstruction from Deep Features

Traditional image resizing methods usually work in pixel space and use v...
research
05/07/2013

High Level Pattern Classification via Tourist Walks in Networks

Complex networks refer to large-scale graphs with nontrivial connection ...

Please sign up or login with your details

Forgot password? Click here to reset