SRRM: Semantic Region Relation Model for Indoor Scene Recognition

05/15/2023
by   Chuanxin Song, et al.
0

Despite the remarkable success of convolutional neural networks in various computer vision tasks, recognizing indoor scenes still presents a significant challenge due to their complex composition. Consequently, effectively leveraging semantic information in the scene has been a key issue in advancing indoor scene recognition. Unfortunately, the accuracy of semantic segmentation has limited the effectiveness of existing approaches for leveraging semantic information. As a result, many of these approaches remain at the stage of auxiliary labeling or co-occurrence statistics, with few exploring the contextual relationships between the semantic elements directly within the scene. In this paper, we propose the Semantic Region Relationship Model (SRRM), which starts directly from the semantic information inside the scene. Specifically, SRRM adopts an adaptive and efficient approach to mitigate the negative impact of semantic ambiguity and then models the semantic region relationship to perform scene recognition. Additionally, to more comprehensively exploit the information contained in the scene, we combine the proposed SRRM with the PlacesCNN module to create the Combined Semantic Region Relation Model (CSRRM), and propose a novel information combining approach to effectively explore the complementary contents between them. CSRRM significantly outperforms the SOTA methods on the MIT Indoor 67, reduced Places365 dataset, and SUN RGB-D without retraining. The code is available at: https://github.com/ChuanxinSong/SRRM

READ FULL TEXT
research
05/22/2023

Semantic-guided context modeling for indoor scene recognition

Exploring the semantic context in scene images is essential for indoor s...
research
11/25/2021

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

Images acquired from rainy scenes usually suffer from bad visibility whi...
research
11/21/2013

Adaptive Learning of Region-based pLSA Model for Total Scene Annotation

In this paper, we present a region-based pLSA model to accomplish the ta...
research
08/01/2021

BORM: Bayesian Object Relation Model for Indoor Scene Recognition

Scene recognition is a fundamental task in robotic perception. For human...
research
08/01/2021

Object-to-Scene: Learning to Transfer Object Knowledge to Indoor Scene Recognition

Accurate perception of the surrounding scene is helpful for robots to ma...
research
01/05/2018

Semantic-aware Grad-GAN for Virtual-to-Real Urban Scene Adaption

Recent advances in vision tasks (e.g., segmentation) highly depend on th...
research
08/22/2023

Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition

We are concerned with a challenging scenario in unpaired multiview video...

Please sign up or login with your details

Forgot password? Click here to reset