Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks

11/26/2020
by   Xiaoxiao Long, et al.
0

We present a novel method for multi-view depth estimation from a single video, which is a critical task in various applications, such as perception, reconstruction and robot navigation. Although previous learning-based methods have demonstrated compelling results, most works estimate depth maps of individual video frames independently, without taking into consideration the strong geometric and temporal coherence among the frames. Moreover, current state-of-the-art (SOTA) models mostly adopt a fully 3D convolution network for cost regularization and therefore require high computational cost, thus limiting their deployment in real-world applications. Our method achieves temporally coherent depth estimation results by using a novel Epipolar Spatio-Temporal (EST) transformer to explicitly associate geometric and temporal correlation with multiple estimated depth maps. Furthermore, to reduce the computational cost, inspired by recent Mixture-of-Experts models, we design a compact hybrid network consisting of a 2D context-aware network and a 3D matching network which learn 2D context information and 3D disparity cues separately. Extensive experiments demonstrate that our method achieves higher accuracy in depth estimation and significant speedup than the SOTA methods.

READ FULL TEXT

page 3

page 6

page 8

research
05/17/2018

Recurrent Neural Network for Learning DenseDepth and Ego-Motion from Video

Learning-based, single-view depth estimation often generalizes poorly to...
research
01/20/2023

Unsupervised Light Field Depth Estimation via Multi-view Feature Matching with Occlusion Prediction

Depth estimation from light field (LF) images is a fundamental step for ...
research
04/14/2023

Efficient Incremental Penetration Depth Estimation between Convex Geometries

Penetration depth (PD) is essential for robotics due to its extensive ap...
research
04/15/2023

Temporally Consistent Online Depth Estimation Using Point-Based Fusion

Depth estimation is an important step in many computer vision problems s...
research
07/26/2023

MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation

We propose MAMo, a novel memory and attention frame-work for monocular v...
research
02/06/2019

Unstructured Multi-View Depth Estimation Using Mask-Based Multiplane Representation

This paper presents a novel method, MaskMVS, to solve depth estimation f...
research
04/14/2023

DeePoint: Pointing Recognition and Direction Estimation From A Fixed View

In this paper, we realize automatic visual recognition and direction est...

Please sign up or login with your details

Forgot password? Click here to reset