Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

04/06/2019
by   Jin Zeng, et al.
0

The growing availability of commodity RGB-D cameras has boosted the applications in the field of scene understanding. However, as a fundamental scene understanding task, surface normal estimation from RGB-D data lacks thorough investigation. In this paper, a hierarchical fusion network with adaptive feature re-weighting is proposed for surface normal estimation from a single RGB-D image. Specifically, the features from color image and depth are successively integrated at multiple scales to ensure global surface smoothness while preserving visually salient details. Meanwhile, the depth features are re-weighted with a confidence map estimated from depth before merging into the color branch to avoid artifacts caused by input depth corruption. Additionally, a hybrid multi-scale loss function is designed to learn accurate normal estimation given noisy ground-truth dataset. Extensive experimental results validate the effectiveness of the fusion strategy and the loss design, outperforming state-of-the-art normal estimation schemes.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 8

research
07/09/2020

Cross-Modal Weighting Network for RGB-D Salient Object Detection

Depth maps contain geometric clues for assisting Salient Object Detectio...
research
01/26/2021

Global-Local Propagation Network for RGB-D Semantic Segmentation

Depth information matters in RGB-D semantic segmentation task for provid...
research
09/14/2019

Deep Robotic Prediction with hierarchical RGB-D Fusion

Robotic is a fundamental operation in robotic control task goals. We con...
research
11/24/2020

Multi-Scale Progressive Fusion Learning for Depth Map Super-Resolution

Limited by the cost and technology, the resolution of depth map collecte...
research
11/03/2017

Multi-Glimpse LSTM with Color-Depth Feature Fusion for Human Detection

With the development of depth cameras such as Kinect and Intel Realsense...
research
09/19/2023

RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing

The recent advancements in deep convolutional neural networks have shown...
research
07/30/2020

NormalGAN: Learning Detailed 3D Human from a Single RGB-D Image

We propose NormalGAN, a fast adversarial learning-based method to recons...

Please sign up or login with your details

Forgot password? Click here to reset