A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo

01/19/2022
by   Wang Zhao, et al.
8

In this paper, we introduce a deep multi-view stereo (MVS) system that jointly predicts depths, surface normals and per-view confidence maps. The key to our approach is a novel solver that iteratively solves for per-view depth map and normal map by optimizing an energy potential based on the locally planar assumption. Specifically, the algorithm updates depth map by propagating from neighboring pixels with slanted planes, and updates normal map with local probabilistic plane fitting. Both two steps are monitored by a customized confidence map. This solver is not only effective as a post-processing tool for plane-based depth refinement and completion, but also differentiable such that it can be efficiently integrated into deep learning pipelines. Our multi-view stereo system employs multiple optimization steps of the solver over the initial prediction of depths and surface normals. The whole system can be trained end-to-end, decoupling the challenging problem of matching pixels within poorly textured regions from the cost-volume based neural network. Experimental results on ScanNet and RGB-D Scenes V2 demonstrate state-of-the-art performance of the proposed deep MVS system on multi-view depth estimation, with our proposed solver consistently improving the depth quality over both conventional and deep learning based MVS pipelines. Code is available at https://github.com/thuzhaowang/idn-solver.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 7

page 8

page 16

page 17

research
03/02/2020

A-TVSNet: Aggregated Two-View Stereo Network for Multi-View Stereo Depth Estimation

We propose a learning-based network for depth map estimation from multi-...
research
03/29/2020

Fast-MVSNet: Sparse-to-Dense Multi-View Stereo With Learned Propagation and Gauss-Newton Refinement

Almost all previous deep learning-based multi-view stereo (MVS) approach...
research
12/01/2021

FaSS-MVS – Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery

With FaSS-MVS, we present an approach for fast multi-view stereo with su...
research
03/02/2022

iMVS: Improving MVS Networks by Learning Depth Discontinuities

Existing learning-based multi-view stereo (MVS) techniques are effective...
research
04/07/2018

MVSNet: Depth Inference for Unstructured Multi-view Stereo

We present an end-to-end deep learning architecture for depth map infere...
research
09/21/2019

Efficient Surface-Aware Semi-Global Matching with Multi-View Plane-Sweep Sampling

Online augmentation of an oblique aerial image sequence with structural ...
research
09/25/2018

Confidence Inference for Focused Learning in Stereo Matching

In this paper, we present confidence inference approachin an unsupervise...

Please sign up or login with your details

Forgot password? Click here to reset