Graph Attention Transformer Network for Multi-Label Image Classification

03/08/2022
by   Jin Yuan, et al.
0

Multi-label classification aims to recognize multiple objects or attributes from images. However, it is challenging to learn from proper label graphs to effectively characterize such inter-label correlations or dependencies. Current methods often use the co-occurrence probability of labels based on the training set as the adjacency matrix to model this correlation, which is greatly limited by the dataset and affects the model's generalization ability. In this paper, we propose a Graph Attention Transformer Network (GATN), a general framework for multi-label image classification that can effectively mine complex inter-label relationships. First, we use the cosine similarity based on the label word embedding as the initial correlation matrix, which can represent rich semantic information. Subsequently, we design the graph attention transformer layer to transfer this adjacency matrix to adapt to the current domain. Our extensive experiments have demonstrated that our proposed methods can achieve state-of-the-art performance on three datasets.

READ FULL TEXT

page 3

page 7

research
12/17/2019

Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification

Multi-label image and video classification are fundamental yet challengi...
research
10/10/2021

Transformer-based Dual Relation Graph for Multi-label Image Recognition

The simultaneous recognition of multiple objects in one image remains a ...
research
11/27/2020

General Multi-label Image Classification with Transformers

Multi-label image classification is the task of predicting a set of labe...
research
11/21/2019

Multi-Label Classification with Label Graph Superimposing

Images or videos always contain multiple objects or actions. Multi-label...
research
01/11/2023

Multi-label Image Classification using Adaptive Graph Convolutional Networks: from a Single Domain to Multiple Domains

This paper proposes an adaptive graph-based approach for multi-label ima...
research
04/28/2023

MASK-CNN-Transformer For Real-Time Multi-Label Weather Recognition

Weather recognition is an essential support for many practical life appl...
research
02/16/2022

Unified smoke and fire detection in an evolutionary framework with self-supervised progressive data augment

Few researches have studied simultaneous detection of smoke and flame ac...

Please sign up or login with your details

Forgot password? Click here to reset