Deep Learning for Fine-Grained Image Analysis: A Survey

by   Xiu-Shen Wei, et al.

Computer vision (CV) is the process of using machines to understand and analyze imagery, which is an integral branch of artificial intelligence. Among various research areas of CV, fine-grained image analysis (FGIA) is a longstanding and fundamental problem, and has become ubiquitous in diverse real-world applications. The task of FGIA targets analyzing visual objects from subordinate categories, , species of birds or models of cars. The small inter-class variations and the large intra-class variations caused by the fine-grained nature makes it a challenging problem. During the booming of deep learning, recent years have witnessed remarkable progress of FGIA using deep learning techniques. In this paper, we aim to give a survey on recent advances of deep learning based FGIA techniques in a systematic way. Specifically, we organize the existing studies of FGIA techniques into three major categories: fine-grained image recognition, fine-grained image retrieval and fine-grained image generation. In addition, we also cover some other important issues of FGIA, such as publicly available benchmark datasets and its related domain specific applications. Finally, we conclude this survey by highlighting several directions and open problems which need be further explored by the community in the future.


page 1

page 2

page 3

page 5


Fine-Grained Image Analysis with Deep Learning: A Survey

Fine-grained image analysis (FGIA) is a longstanding and fundamental pro...

cofga: A Dataset for Fine Grained Classification of Objects from Aerial Imagery

Detection and classification of objects in overhead images are two impor...

Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark Analysis

Human body-pose estimation is a complex problem in computer vision. Rece...

Three-branch and Mutil-scale learning for Fine-grained Image Recognition (TBMSL-Net)

ImageNet Large Scale Visual Recognition Challenge (ILSVRC) is one of the...

Towards a Scalable and Distributed Infrastructure for Deep Learning Applications

Although recent scaling up approaches to train deep neural networks have...

An Analysis of Action Recognition Datasets for Language and Vision Tasks

A large amount of recent research has focused on tasks that combine lang...

Open Domain Suggestion Mining Leveraging Fine-Grained Analysis

Suggestion mining tasks are often semantically complex and lack sophisti...

Please sign up or login with your details

Forgot password? Click here to reset