Semantic-Aware Scene Recognition

Scene recognition is currently one of the top-challenging research fields in computer vision. This may be due to the ambiguity between classes: images of several scene classes may share similar objects, which causes confusion among them. The problem is aggravated when images of a particular scene class are notably different. Convolutional Neural Networks (CNNs) have significantly boosted performance in scene recognition, albeit it is still far below from other recognition tasks (e.g., object or image recognition). In this paper, we describe a novel approach for scene recognition based on an end-to-end multi-modal CNN that combines image and context information by means of an attention module. Context information, in the shape of semantic segmentation, is used to gate features extracted from the RGB image by leveraging on information encoded in the semantic representation: the set of scene objects and stuff, and their relative locations. This gating process reinforces the learning of indicative scene content and enhances scene disambiguation by refocusing the receptive fields of the CNN towards them. Experimental results on four publicly available datasets show that the proposed approach outperforms every other state-of-the-art method while significantly reducing the number of network parameters. All the code and data used along this paper is available at https://github.com/vpulab/Semantic-Aware-Scene-Recognition

READ FULL TEXT

page 3

page 12

page 13

page 22

page 23

research
05/22/2023

Semantic-guided context modeling for indoor scene recognition

Exploring the semantic context in scene images is essential for indoor s...
research
08/24/2017

A Robust Indoor Scene Recognition Method based on Sparse Representation

In this paper, we present a robust method for scene recognition, which l...
research
05/12/2018

Exploring object-centric and scene-centric CNN features and their complementarity for human rights violations recognition in images

Identifying potential abuses of human rights through imagery is a novel ...
research
07/17/2019

FOSNet: An End-to-End Trainable Deep Neural Network for Scene Recognition

Scene recognition is an image recognition problem aimed at predicting th...
research
08/06/2018

Visual Question Generation for Class Acquisition of Unknown Objects

Traditional image recognition methods only consider objects belonging to...
research
10/07/2021

Design of an Intelligent Vision Algorithm for Recognition and Classification of Apples in an Orchard Scene

Apple is one of the remarkable fresh fruit that contains a high degree o...
research
08/23/2022

RGB-D Scene Recognition based on Object-Scene Relation

We develop a RGB-D scene recognition model based on object-scene relatio...

Please sign up or login with your details

Forgot password? Click here to reset