MVSNet: Depth Inference for Unstructured Multi-view Stereo

04/07/2018
by   Yao Yao, et al.
0

We present an end-to-end deep learning architecture for depth map inference from multi-view images. In the network, we first extract deep visual image features, and then build the 3D cost volume upon the reference camera frustum via the differentiable homography warping. Next, we apply 3D convolutions to regularize and regress the initial depth map, which is then refined with the reference image to generate the final output. Our framework flexibly adapts arbitrary N-view inputs using a variance-based cost metric that maps multiple features into one cost feature. The proposed MVSNet is demonstrated on the large-scale indoor DTU dataset. With simple post-processing, our method not only significantly outperforms previous state-of-the-arts, but also is several times faster in runtime. We also evaluate MVSNet on the complex outdoor Tanks and Temples dataset, where our method ranks first without any fine-tuning, showing the strong generalization ability of MVSNet.

READ FULL TEXT

page 7

page 9

page 11

page 12

research
05/28/2022

RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo

In this paper, we present a learning-based approach for multi-view stere...
research
08/30/2019

MVS^2: Deep Unsupervised Multi-view Stereo with Multi-View Symmetry

The success of existing deep-learning based multi-view stereo (MVS) appr...
research
01/19/2022

A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo

In this paper, we introduce a deep multi-view stereo (MVS) system that j...
research
04/27/2021

MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions

Deep learning has made significant impacts on multi-view stereo systems....
research
06/13/2018

BA-Net: Dense Bundle Adjustment Network

This paper introduces a neural network to solve the structure-from-motio...
research
07/03/2020

ODE-CNN: Omnidirectional Depth Extension Networks

Omnidirectional 360 camera proliferates rapidly for autonomous robots si...
research
01/31/2019

Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images

Recovering the 3D representation of an object from single-view or multi-...

Please sign up or login with your details

Forgot password? Click here to reset