Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition

12/20/2017
by   Tianshui Chen, et al.
0

Recognizing multiple labels of images is a fundamental but challenging task in computer vision, and remarkable progress has been attained by localizing semantic-aware image regions and predicting their labels with deep convolutional neural networks. The step of hypothesis regions (region proposals) localization in these existing multi-label image recognition pipelines, however, usually takes redundant computation cost, e.g., generating hundreds of meaningless proposals with non-discriminative information and extracting their features, and the spatial contextual dependency modeling among the localized regions are often ignored or over-simplified. To resolve these issues, this paper proposes a recurrent attention reinforcement learning framework to iteratively discover a sequence of attentional and informative regions that are related to different semantic objects and further predict label scores conditioned on these regions. Besides, our method explicitly models long-term dependencies among these attentional regions that help to capture semantic label co-occurrence and thus facilitate multi-label recognition. Extensive experiments and comparisons on two large-scale benchmarks (i.e., PASCAL VOC and MS-COCO) show that our model achieves superior performance over existing state-of-the-art methods in both performance and efficiency as well as explicitly identifying image-level semantic labels to specific object regions.

READ FULL TEXT

page 3

page 7

research
11/08/2017

Multi-label Image Recognition by Recurrently Discovering Attentional Regions

This paper proposes a novel deep architecture to address multi-label ima...
research
04/08/2022

Semantic Representation and Dependency Learning for Multi-Label Image Recognition

Recently many multi-label image recognition (MLR) works have made signif...
research
11/24/2021

Spatial-context-aware deep neural network for multi-class image classification

Multi-label image classification is a fundamental but challenging task i...
research
03/15/2023

Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization

Temporal action localization (TAL) is a prevailing task due to its great...
research
09/20/2020

Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition

Recognizing multiple labels of an image is a practical yet challenging t...
research
08/05/2021

Residual Attention: A Simple but Effective Method for Multi-Label Recognition

Multi-label image recognition is a challenging computer vision task of p...
research
08/20/2019

Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition

Recognizing multiple labels of images is a practical and challenging tas...

Please sign up or login with your details

Forgot password? Click here to reset