Coarse to Fine: Multi-label Image Classification with Global/Local Attention

12/26/2020
by   Fan lyu, et al.
0

In our daily life, the scenes around us are always with multiple labels especially in a smart city, i.e., recognizing the information of city operation to response and control. Great efforts have been made by using Deep Neural Networks to recognize multi-label images. Since multi-label image classification is very complicated, people seek to use the attention mechanism to guide the classification process. However, conventional attention-based methods always analyzed images directly and aggressively. It is difficult for them to well understand complicated scenes. In this paper, we propose a global/local attention method that can recognize an image from coarse to fine by mimicking how human-beings observe images. Specifically, our global/local attention method first concentrates on the whole image, and then focuses on local specific objects in the image. We also propose a joint max-margin objective function, which enforces that the minimum score of positive labels should be larger than the maximum score of negative labels horizontally and vertically. This function can further improve our multi-label image classification method. We evaluate the effectiveness of our method on two popular multi-label image datasets (i.e., Pascal VOC and MS-COCO). Our experimental results show that our method outperforms state-of-the-art methods.

READ FULL TEXT

page 5

page 6

research
06/11/2021

MlTr: Multi-label Classification with Transformer

The task of multi-label image classification is to recognize all the obj...
research
11/14/2017

Saliency-based Sequential Image Attention with Multiset Prediction

Humans process visual scenes selectively and sequentially using attentio...
research
12/17/2019

Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification

Multi-label image and video classification are fundamental yet challengi...
research
07/03/2020

Multi-Label Image Recognition with Multi-Class Attentional Regions

Multi-label image recognition is a practical and challenging task compar...
research
09/29/2021

Can multi-label classification networks know what they don't know?

Estimating out-of-distribution (OOD) uncertainty is a central challenge ...
research
11/24/2021

Spatial-context-aware deep neural network for multi-class image classification

Multi-label image classification is a fundamental but challenging task i...
research
02/16/2023

An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification

Although music is typically multi-label, many works have studied hierarc...

Please sign up or login with your details

Forgot password? Click here to reset