Non-parametric Depth Distribution Modelling based Depth Inference for Multi-view Stereo

05/08/2022
by   Jiayu Yang, et al.
0

Recent cost volume pyramid based deep neural networks have unlocked the potential of efficiently leveraging high-resolution images for depth inference from multi-view stereo. In general, those approaches assume that the depth of each pixel follows a unimodal distribution. Boundary pixels usually follow a multi-modal distribution as they represent different depths; Therefore, the assumption results in an erroneous depth prediction at the coarser level of the cost volume pyramid and can not be corrected in the refinement levels leading to wrong depth predictions. In contrast, we propose constructing the cost volume by non-parametric depth distribution modeling to handle pixels with unimodal and multi-modal distributions. Our approach outputs multiple depth hypotheses at the coarser level to avoid errors in the early stage. As we perform local search around these multiple hypotheses in subsequent levels, our approach does not maintain the rigid depth spatial ordering and, therefore, we introduce a sparse cost aggregation network to derive information within each volume. We evaluate our approach extensively on two benchmark datasets: DTU and Tanks Temples. Our experimental results show that our model outperforms existing methods by a large margin and achieves superior performance on boundary regions. Code is available at https://github.com/NVlabs/NP-CVP-MVSNet

READ FULL TEXT

page 1

page 4

page 6

page 7

page 8

research
03/26/2021

DDR-Net: Learning Multi-Stage Multi-View Stereo With Dynamic Depth Range

To obtain high-resolution depth maps, some previous learning-based multi...
research
12/18/2019

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

We propose a cost volume based neural network for depth inference from m...
research
07/25/2022

Cost Volume Pyramid Network with Multi-strategies Range Searching for Multi-view Stereo

Multi-view stereo is an important research task in computer vision while...
research
11/25/2020

Attention Aware Cost Volume Pyramid Based Multi-view Stereo Network for 3D Reconstruction

We present an efficient multi-view stereo (MVS) network for 3D reconstru...
research
03/29/2020

Fast-MVSNet: Sparse-to-Dense Multi-View Stereo With Learned Propagation and Gauss-Newton Refinement

Almost all previous deep learning-based multi-view stereo (MVS) approach...
research
04/06/2022

DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors

Camera-based 3D object detectors are welcome due to their wider deployme...
research
04/28/2019

Weighted Dark Channel Dehazing

In dark channel based methods, local constant assumption is widely used ...

Please sign up or login with your details

Forgot password? Click here to reset