A Deep Learning-based Global and Segmentation-based Semantic Feature Fusion Approach for Indoor Scene Classification

02/13/2023
by   Ricardo Pereira, et al.
1

Indoor scene classification has become an important task in perception modules and has been widely used in various applications. However, problems such as intra-category variability and inter-category similarity have been holding back the models' performance, which leads to the need for new types of features to obtain a more meaningful scene representation. A semantic segmentation mask provides pixel-level information about the objects available in the scene, which makes it a promising source of information to obtain a more meaningful local representation of the scene. Therefore, in this work, a novel approach that uses a semantic segmentation mask to obtain a 2D spatial layout of the object categories across the scene, designated by segmentation-based semantic features (SSFs), is proposed. These features represent, per object category, the pixel count, as well as the 2D average position and respective standard deviation values. Moreover, a two-branch network, GS2F2App, that exploits CNN-based global features extracted from RGB images and the segmentation-based features extracted from the proposed SSFs, is also proposed. GS2F2App was evaluated in two indoor scene benchmark datasets: the SUN RGB-D and the NYU Depth V2, achieving state-of-the-art results on both datasets.

READ FULL TEXT
research
01/26/2021

Global-Local Propagation Network for RGB-D Semantic Segmentation

Depth information matters in RGB-D semantic segmentation task for provid...
research
11/01/2019

Centroid-Based Scene Classification (CBSC): Using Deep Features and Clustering for RGB-D Indoor Scene Classification

This paper contributes a novel method for RGB-D indoor scene classificat...
research
12/30/2013

Constrained Parametric Proposals and Pooling Methods for Semantic Segmentation in RGB-D Images

We focus on the problem of semantic segmentation based on RGB-D data, wi...
research
04/06/2021

3D-to-2D Distillation for Indoor Scene Parsing

Indoor scene semantic parsing from RGB images is very challenging due to...
research
07/23/2023

EnTri: Ensemble Learning with Tri-level Representations for Explainable Scene Recognition

Scene recognition based on deep-learning has made significant progress, ...
research
10/13/2018

CPNet: A Context Preserver Convolutional Neural Network for Detecting Shadows in Single RGB Images

Automatic detection of shadow regions in an image is a difficult task du...
research
01/18/2023

Model-based inexact graph matching on top of CNNs for semantic scene understanding

Deep learning based pipelines for semantic segmentation often ignore str...

Please sign up or login with your details

Forgot password? Click here to reset