Multi-Label Image Classification with Regional Latent Semantic Dependencies

12/04/2016
by   Junjie Zhang, et al.
0

Deep convolution neural networks (CNN) have demonstrated advanced performance on single-label image classification, and various progress also have been made to apply CNN methods on multi-label image classification, which requires to annotate objects, attributes, scene categories etc. in a single shot. Recent state-of-the-art approaches to multi-label image classification exploit the label dependencies in an image, at global level, largely improving the labeling capacity. However, predicting small objects and visual concepts is still challenging due to the limited discrimination of the global visual features. In this paper, we propose a Regional Latent Semantic Dependencies model (RLSD) to address this problem. The utilized model includes a fully convolutional localization architecture to localize the regions that may contain multiple highly-dependent labels. The localized regions are further sent to the recurrent neural networks (RNN) to characterize the latent semantic dependencies at the regional level. Experimental results on several benchmark datasets show that our proposed model achieves the best performance compared to the state-of-the-art models, especially for predicting small objects occurred in the images. In addition, we set up an upper bound model (RLSD+ft-RPN) using bounding box coordinates during training, the experimental results also show that our RLSD can approach the upper bound without using the bounding-box annotations, which is more realistic in the real world.

READ FULL TEXT

page 1

page 2

page 4

page 7

page 10

research
04/15/2016

CNN-RNN: A Unified Framework for Multi-label Image Classification

While deep convolutional neural networks (CNNs) have shown a great succe...
research
04/22/2015

Exploit Bounding Box Annotations for Multi-label Object Recognition

Convolutional neural networks (CNNs) have shown great performance as gen...
research
11/27/2020

General Multi-label Image Classification with Transformers

Multi-label image classification is the task of predicting a set of labe...
research
02/21/2018

Learning Image Conditioned Label Space for Multilabel Classification

This work addresses the task of multilabel image classification. Inspire...
research
04/15/2016

Latent Model Ensemble with Auto-localization

Deep Convolutional Neural Networks (CNN) have exhibited superior perform...
research
06/22/2014

CNN: Single-label to Multi-label

Convolutional Neural Network (CNN) has demonstrated promising performanc...
research
11/16/2016

Semantic Regularisation for Recurrent Image Annotation

The "CNN-RNN" design pattern is increasingly widely applied in a variety...

Please sign up or login with your details

Forgot password? Click here to reset