AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network

08/09/2021
by   Zizhuang Wei, et al.
5

In this paper, we present a novel recurrent multi-view stereo network based on long short-term memory (LSTM) with adaptive aggregation, namely AA-RMVSNet. We firstly introduce an intra-view aggregation module to adaptively extract image features by using context-aware convolution and multi-scale aggregation, which efficiently improves the performance on challenging regions, such as thin objects and large low-textured surfaces. To overcome the difficulty of varying occlusion in complex scenes, we propose an inter-view cost volume aggregation module for adaptive pixel-wise view aggregation, which is able to preserve better-matched pairs among all views. The two proposed adaptive aggregation modules are lightweight, effective and complementary regarding improving the accuracy and completeness of 3D reconstruction. Instead of conventional 3D CNNs, we utilize a hybrid network with recurrent structure for cost volume regularization, which allows high-resolution reconstruction and finer hypothetical plane sweep. The proposed network is trained end-to-end and achieves excellent performance on various datasets. It ranks 1^st among all submissions on Tanks and Temples benchmark and achieves competitive results on DTU dataset, which exhibits strong generalizability and robustness. Implementation of our method is available at https://github.com/QT-Zhu/AA-RMVSNet.

READ FULL TEXT

page 1

page 5

page 6

page 11

page 13

page 14

page 15

page 16

research
12/06/2019

Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation

In this paper, we propose an effective and efficient pyramid multi-view ...
research
07/21/2020

Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking

In this paper, we propose an efficient and effective dense hybrid recurr...
research
12/09/2021

IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo

We present IterMVS, a new data-driven method for high-resolution multi-v...
research
11/29/2021

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers

In this paper, we present TransMVSNet, based on our exploration of featu...
research
05/25/2021

SRH-Net: Stacked Recurrent Hourglass Network for Stereo Matching

The cost aggregation strategy shows a crucial role in learning-based ste...
research
07/24/2019

Recurrent Aggregation Learning for Multi-View Echocardiographic Sequences Segmentation

Multi-view echocardiographic sequences segmentation is crucial for clini...
research
06/10/2022

Out of Sight, Out of Mind: A Source-View-Wise Feature Aggregation for Multi-View Image-Based Rendering

To estimate the volume density and color of a 3D point in the multi-view...

Please sign up or login with your details

Forgot password? Click here to reset