Neural Video Depth Stabilizer

07/17/2023
by   Yiran Wang, et al.
0

Video depth estimation aims to infer temporally consistent depth. Some methods achieve temporal consistency by finetuning a single-image depth model during test time using geometry and re-projection constraints, which is inefficient and not robust. An alternative approach is to learn how to enforce temporal consistency from data, but this requires well-designed models and sufficient video depth data. To address these challenges, we propose a plug-and-play framework called Neural Video Depth Stabilizer (NVDS) that stabilizes inconsistent depth estimations and can be applied to different single-image depth models without extra effort. We also introduce a large-scale dataset, Video Depth in the Wild (VDW), which consists of 14,203 videos with over two million frames, making it the largest natural-scene video depth dataset to our knowledge. We evaluate our method on the VDW dataset as well as two public benchmarks and demonstrate significant improvements in consistency, accuracy, and efficiency compared to previous approaches. Our work serves as a solid baseline and provides a data foundation for learning-based video depth models. We will release our dataset and code for future research.

READ FULL TEXT

page 3

page 5

page 6

page 12

page 13

page 14

page 16

page 17

research
04/15/2023

Temporally Consistent Online Depth Estimation Using Point-Based Fusion

Depth estimation is an important step in many computer vision problems s...
research
07/31/2022

Less is More: Consistent Video Depth Estimation with Masked Frames Modeling

Temporal consistency is the key challenge of video depth estimation. Pre...
research
06/25/2018

Learning Single-Image Depth from Videos using Quality Assessment Networks

Although significant progress has been made in recent years, depth estim...
research
04/30/2020

Consistent Video Depth Estimation

We present an algorithm for reconstructing dense, geometrically consiste...
research
04/22/2022

Learning Dynamic View Synthesis With Few RGBD Cameras

There have been significant advancements in dynamic novel view synthesis...
research
02/11/2015

Large-Scale Deep Learning on the YFCC100M Dataset

We present a work-in-progress snapshot of learning with a 15 billion par...
research
09/16/2019

Temporally Consistent Depth Prediction with Flow-Guided Memory Units

Predicting depth from a monocular video sequence is an important task fo...

Please sign up or login with your details

Forgot password? Click here to reset