A Novel Recurrent Encoder-Decoder Structure for Large-Scale Multi-view Stereo Reconstruction from An Open Aerial Dataset

03/02/2020
by   Jin Liu, et al.
0

A great deal of research has demonstrated recently that multi-view stereo (MVS) matching can be solved with deep learning methods. However, these efforts were focused on close-range objects and only a very few of the deep learning-based methods were specifically designed for large-scale 3D urban reconstruction due to the lack of multi-view aerial image benchmarks. In this paper, we present a synthetic aerial dataset, called the WHU dataset, we created for MVS tasks, which, to our knowledge, is the first large-scale multi-view aerial dataset. It was generated from a highly accurate 3D digital surface model produced from thousands of real aerial images with precise camera parameters. We also introduce in this paper a novel network, called RED-Net, for wide-range depth inference, which we developed from a recurrent encoder-decoder structure to regularize cost maps across depths and a 2D fully convolutional network as framework. RED-Net's low memory requirements and high performance make it suitable for large-scale and highly accurate 3D Earth surface reconstruction. Our experiments confirmed that not only did our method exceed the current state-of-the-art MVS methods by more than 50 error (MAE) with less memory and computational cost, but its efficiency as well. It outperformed one of the best commercial software programs based on conventional methods, improving their efficiency 16 times over. Moreover, we proved that our RED-Net model pre-trained on the synthetic WHU dataset can be efficiently transferred to very different multi-view aerial image datasets without any fine-tuning. Dataset are available at http://gpcv.whu.edu.cn/data.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

page 8

research
09/23/2021

Rational Polynomial Camera Model Warping for Deep Learning Based Satellite Multi-View Stereo Matching

Satellite multi-view stereo (MVS) imagery is particularly suited for lar...
research
08/17/2019

OmniMVS: End-to-End Learning for Omnidirectional Stereo Matching

In this paper, we propose a novel end-to-end deep neural network model f...
research
07/18/2022

Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction

In this paper, a complete pipeline for image-based 3D reconstruction of ...
research
12/06/2019

Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation

In this paper, we propose an effective and efficient pyramid multi-view ...
research
06/05/2023

Computational 3D topographic microscopy from terabytes of data per sample

We present a large-scale computational 3D topographic microscope that en...
research
04/02/2018

DeepMVS: Learning Multi-view Stereopsis

We present DeepMVS, a deep convolutional neural network (ConvNet) for mu...
research
07/26/2019

MVB: A Large-Scale Dataset for Baggage Re-Identification and Merged Siamese Networks

In this paper, we present a novel dataset named MVB (Multi View Baggage)...

Please sign up or login with your details

Forgot password? Click here to reset