ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation

08/24/2021
by   Jinming Cao, et al.
5

RGB-D semantic segmentation has attracted increasing attention over the past few years. Existing methods mostly employ homogeneous convolution operators to consume the RGB and depth features, ignoring their intrinsic differences. In fact, the RGB values capture the photometric appearance properties in the projected image space, while the depth feature encodes both the shape of a local geometry as well as the base (whereabout) of it in a larger context. Compared with the base, the shape probably is more inherent and has a stronger connection to the semantics, and thus is more critical for segmentation accuracy. Inspired by this observation, we introduce a Shape-aware Convolutional layer (ShapeConv) for processing the depth feature, where the depth feature is firstly decomposed into a shape-component and a base-component, next two learnable weights are introduced to cooperate with them independently, and finally a convolution is applied on the re-weighted combination of these two components. ShapeConv is model-agnostic and can be easily integrated into most CNNs to replace vanilla convolutional layers for semantic segmentation. Extensive experiments on three challenging indoor RGB-D semantic segmentation benchmarks, i.e., NYU-Dv2(-13,-40), SUN RGB-D, and SID, demonstrate the effectiveness of our ShapeConv when employing it over five popular architectures. Moreover, the performance of CNNs with ShapeConv is boosted without introducing any computation and memory increase in the inference phase. The reason is that the learnt weights for balancing the importance between the shape and base components in ShapeConv become constants in the inference phase, and thus can be fused into the following convolution, resulting in a network that is identical to one with vanilla convolutional layers.

READ FULL TEXT

page 1

page 7

page 8

research
03/19/2018

Depth-aware CNN for RGB-D Segmentation

Convolutional neural networks (CNN) are limited by the lack of capabilit...
research
06/08/2022

Depth-Adapted CNNs for RGB-D Semantic Segmentation

Recent RGB-D semantic segmentation has motivated research interest thank...
research
02/23/2023

Pixel Difference Convolutional Network for RGB-D Semantic Segmentation

RGB-D semantic segmentation can be advanced with convolutional neural ne...
research
05/25/2021

Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks

Many research works focus on leveraging the complementary geometric info...
research
12/04/2018

SurfConv: Bridging 3D and 2D Convolution for RGBD Images

We tackle the problem of using 3D information in convolutional neural ne...
research
04/09/2020

Spatial Information Guided Convolution for Real-Time RGBD Semantic Segmentation

3D spatial information is known to be beneficial to the semantic segment...
research
10/17/2022

Deformably-Scaled Transposed Convolution

Transposed convolution is crucial for generating high-resolution outputs...

Please sign up or login with your details

Forgot password? Click here to reset