Depth-aware CNN for RGB-D Segmentation

03/19/2018
by   Weiyue Wang, et al.
0

Convolutional neural networks (CNN) are limited by the lack of capability to handle geometric information due to the fixed grid kernel structure. The availability of depth data enables progress in RGB-D semantic segmentation with CNNs. State-of-the-art methods either use depth as additional images or process spatial information in 3D volumes or point clouds. These methods suffer from high computation and memory cost. To address these issues, we present Depth-aware CNN by introducing two intuitive, flexible and effective operations: depth-aware convolution and depth-aware average pooling. By leveraging depth similarity between pixels in the process of information propagation, geometry is seamlessly incorporated into CNN. Without introducing any additional parameters, both operators can be easily integrated into existing CNNs. Extensive experiments and ablation studies on challenging RGB-D semantic segmentation benchmarks validate the effectiveness and flexibility of our approach.

READ FULL TEXT

page 2

page 7

page 10

page 12

research
06/08/2022

Depth-Adapted CNNs for RGB-D Semantic Segmentation

Recent RGB-D semantic segmentation has motivated research interest thank...
research
08/24/2021

ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation

RGB-D semantic segmentation has attracted increasing attention over the ...
research
09/21/2020

Depth-Adapted CNN for RGB-D cameras

Conventional 2D Convolutional Neural Networks (CNN) extract features fro...
research
02/23/2023

Pixel Difference Convolutional Network for RGB-D Semantic Segmentation

RGB-D semantic segmentation can be advanced with convolutional neural ne...
research
06/27/2019

Hard Pixels Mining: Learning Using Privileged Information for Semantic Segmentation

Semantic segmentation has achieved significant progress but is still cha...
research
07/18/2020

Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing

Depth data provide geometric information that can bring progress in RGB-...
research
01/08/2021

HIVE-Net: Centerline-Aware HIerarchical View-Ensemble Convolutional Network for Mitochondria Segmentation in EM Images

Semantic segmentation of electron microscopy (EM) is an essential step t...

Please sign up or login with your details

Forgot password? Click here to reset