DeepAI AI Chat
Log In Sign Up

Deep Semantic Dictionary Learning for Multi-label Image Classification

by   Fengtao Zhou, et al.

Compared with single-label image classification, multi-label image classification is more practical and challenging. Some recent studies attempted to leverage the semantic information of categories for improving multi-label image classification performance. However, these semantic-based methods only take semantic information as type of complements for visual representation without further exploitation. In this paper, we present a innovative path towards the solution of the multi-label image classification which considers it as a dictionary learning task. A novel end-to-end model named Deep Semantic Dictionary Learning (DSDL) is designed. In DSDL, an auto-encoder is applied to generate the semantic dictionary from class-level semantics and then such dictionary is utilized for representing the visual features extracted by Convolutional Neural Network (CNN) with label embeddings. The DSDL provides a simple but elegant way to exploit and reconcile the label, semantic and visual spaces simultaneously via conducting the dictionary learning among them. Moreover, inspired by iterative optimization of traditional dictionary learning, we further devise a novel training strategy named Alternately Parameters Update Strategy (APUS) for optimizing DSDL, which alteratively optimizes the representation coefficients and the semantic dictionary in forward and backward propagation. Extensive experimental results on three popular benchmarks demonstrate that our method achieves promising performances in comparison with the state-of-the-arts. Our codes and models are available at


page 1

page 4


Class Specific or Shared? A Hybrid Dictionary Learning Network for Image Classification

Dictionary learning methods can be split into two categories: i) class s...

Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning

In most image retrieval systems, images include various high-level seman...

MlTr: Multi-label Classification with Transformer

The task of multi-label image classification is to recognize all the obj...

Semantic Interleaving Global Channel Attention for Multilabel Remote Sensing Image Classification

Multi-Label Remote Sensing Image Classification (MLRSIC) has received in...

Plot2API: Recommending Graphic API from Plot via Semantic Parsing Guided Neural Network

Plot-based Graphic API recommendation (Plot2API) is an unstudied but mea...

Structured Analysis Dictionary Learning for Image Classification

We propose a computationally efficient and high-performance classificati...