GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition

08/28/2023
by   Ruijie Yao, et al.
0

Multi-Label Image Recognition (MLIR) is a challenging task that aims to predict multiple object labels in a single image while modeling the complex relationships between labels and image regions. Although convolutional neural networks and vision transformers have succeeded in processing images as regular grids of pixels or patches, these representations are sub-optimal for capturing irregular and discontinuous regions of interest. In this work, we present the first fully graph convolutional model, Group K-nearest neighbor based Graph convolutional Network (GKGNet), which models the connections between semantic label embeddings and image patches in a flexible and unified graph structure. To address the scale variance of different objects and to capture information from multiple perspectives, we propose the Group KGCN module for dynamic graph construction and message passing. Our experiments demonstrate that GKGNet achieves state-of-the-art performance with significantly lower computational costs on the challenging multi-label datasets, MS-COCO and VOC2007 datasets. We will release the code and models to facilitate future research in this area.

READ FULL TEXT

page 7

page 10

page 11

research
04/07/2019

Multi-Label Image Recognition with Graph Convolutional Networks

The task of multi-label image recognition is to predict a set of object ...
research
12/17/2022

Multi-Scale Relational Graph Convolutional Network for Multiple Instance Learning in Histopathology Images

Graph convolutional neural networks have shown significant potential in ...
research
09/28/2019

Learning Category Correlations for Multi-label Image Recognition with Graph Networks

Multi-label image recognition is a task that predicts a set of object la...
research
03/10/2022

AGCN: Augmented Graph Convolutional Network for Lifelong Multi-label Image Recognition

The Lifelong Multi-Label (LML) image recognition builds an online class-...
research
10/10/2021

Transformer-based Dual Relation Graph for Multi-label Image Recognition

The simultaneous recognition of multiple objects in one image remains a ...
research
11/27/2022

Multi-Label Continual Learning using Augmented Graph Convolutional Network

Multi-Label Continual Learning (MLCL) builds a class-incremental framewo...
research
11/08/2017

Multi-label Image Recognition by Recurrently Discovering Attentional Regions

This paper proposes a novel deep architecture to address multi-label ima...

Please sign up or login with your details

Forgot password? Click here to reset