Multi-Label Image Recognition with Graph Convolutional Networks

04/07/2019
by   Zhao-Min Chen, et al.
32

The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To capture and explore such important dependencies, we propose a multi-label classification model based on Graph Convolutional Network (GCN). The model builds a directed graph over the object labels, where each node (label) is represented by word embeddings of a label, and GCN is learned to map this label graph into a set of inter-dependent object classifiers. These classifiers are applied to the image descriptors extracted by another sub-net, enabling the whole network to be end-to-end trainable. Furthermore, we propose a novel re-weighted scheme to create an effective label correlation matrix to guide information propagation among the nodes in GCN. Experiments on two multi-label image recognition datasets show that our approach obviously outperforms other existing state-of-the-art methods. In addition, visualization analyses reveal that the classifiers learned by our model maintain meaningful semantic topology.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

research
09/28/2019

Learning Category Correlations for Multi-label Image Recognition with Graph Networks

Multi-label image recognition is a task that predicts a set of object la...
research
12/26/2019

Multi-Label Graph Convolutional Network Representation Learning

Knowledge representation of graph-based systems is fundamental across ma...
research
08/28/2023

GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition

Multi-Label Image Recognition (MLIR) is a challenging task that aims to ...
research
01/11/2023

Multi-label Image Classification using Adaptive Graph Convolutional Networks: from a Single Domain to Multiple Domains

This paper proposes an adaptive graph-based approach for multi-label ima...
research
01/27/2020

An Ontology-Aware Framework for Audio Event Classification

Recent advancements in audio event classification often ignore the struc...
research
02/22/2023

BB-GCN: A Bi-modal Bridged Graph Convolutional Network for Multi-label Chest X-Ray Recognition

Multi-label chest X-ray (CXR) recognition involves simultaneously diagno...
research
10/12/2017

Graph Convolutional Networks for Classification with a Structured Label Space

It is a usual practice to ignore any structural information underlying c...

Please sign up or login with your details

Forgot password? Click here to reset