BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks

11/22/2019
by   Yao Yao, et al.
21

While deep learning has recently achieved great success on multi-view stereo (MVS), limited training data makes the trained model hard to be generalized to unseen scenarios. Compared with other computer vision tasks, it is rather difficult to collect a large-scale MVS dataset as it requires expensive active scanners and labor-intensive process to obtain ground truth 3D structures. In this paper, we introduce BlendedMVS, a novel large-scale dataset, to provide sufficient training ground truth for learning-based MVS. To create the dataset, we apply a 3D reconstruction pipeline to recover high-quality textured meshes from images of well-selected scenes. Then, we render these mesh models to color images and depth maps. The rendered color images are further blended with the input images to generate photo-realistic blended images as the training input. Our dataset contains over 17k high-resolution images covering a variety of scenes, including cities, architectures, sculptures and small objects. Extensive experiments demonstrate that BlendedMVS endows the trained model with significantly better generalization ability compared with other MVS datasets. The entire dataset with pretrained models will be made publicly available at https://github.com/YoYo000/BlendedMVS.

READ FULL TEXT

page 2

page 5

page 7

page 12

page 13

page 14

page 15

page 16

research
02/27/2019

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

Deep learning has recently demonstrated its excellent performance for mu...
research
03/28/2020

Real-MFF Dataset: A Large Realistic Multi-focus Image Dataset with Ground Truth

Multi-focus image fusion, a technique to generate an all-in-focus image ...
research
04/02/2018

MegaDepth: Learning Single-View Depth Prediction from Internet Photos

Single-view depth prediction is a fundamental problem in computer vision...
research
10/28/2021

Towards Large-Scale Rendering of Simulated Crops for Synthetic Ground Truth Generation on Modular Supercomputers

Computer Vision problems deal with the semantic extraction of informatio...
research
04/18/2016

Using Self-Contradiction to Learn Confidence Measures in Stereo Vision

Learned confidence measures gain increasing importance for outlier remov...
research
11/29/2018

Progressive Recurrent Learning for Visual Recognition

Computer vision is difficult, partly because the mathematical function c...
research
08/28/2020

Pixel-Face: A Large-Scale, High-Resolution Benchmark for 3D Face Reconstruction

3D face reconstruction is a fundamental task that can facilitate numerou...

Please sign up or login with your details

Forgot password? Click here to reset