Temporally Consistent Depth Prediction with Flow-Guided Memory Units

09/16/2019
by   Chanho Eom, et al.
2

Predicting depth from a monocular video sequence is an important task for autonomous driving. Although it has advanced considerably in the past few years, recent methods based on convolutional neural networks (CNNs) discard temporal coherence in the video sequence and estimate depth independently for each frame, which often leads to undesired inconsistent results over time. To address this problem, we propose to memorize temporal consistency in the video sequence, and leverage it for the task of depth prediction. To this end, we introduce a two-stream CNN with a flow-guided memory module, where each stream encodes visual and temporal features, respectively. The memory module, implemented using convolutional gated recurrent units (ConvGRUs), inputs visual and temporal features sequentially together with optical flow tailored to our task. It memorizes trajectories of individual features selectively and propagates spatial information over time, enforcing a long-term temporal consistency to prediction results. We evaluate our method on the KITTI benchmark dataset in terms of depth prediction accuracy, temporal consistency and runtime, and achieve a new state of the art. We also provide an extensive experimental analysis, clearly demonstrating the effectiveness of our approach to memorizing temporal consistency for depth prediction.

READ FULL TEXT

page 1

page 2

page 3

page 8

page 9

page 11

research
12/01/2017

Learning to Segment Moving Objects

We study the problem of segmenting moving objects in unconstrained video...
research
05/04/2019

Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for Lipreading

We focus on the word-level visual lipreading, which requires recognizing...
research
07/26/2023

MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation

We propose MAMo, a novel memory and attention frame-work for monocular v...
research
07/31/2022

Less is More: Consistent Video Depth Estimation with Masked Frames Modeling

Temporal consistency is the key challenge of video depth estimation. Pre...
research
08/10/2019

Exploiting temporal consistency for real-time video depth estimation

Accuracy of depth estimation from static images has been significantly i...
research
07/17/2023

Neural Video Depth Stabilizer

Video depth estimation aims to infer temporally consistent depth. Some m...
research
04/01/2021

Neural Video Portrait Relighting in Real-time via Consistency Modeling

Video portraits relighting is critical in user-facing human photography,...

Please sign up or login with your details

Forgot password? Click here to reset