Consistent Depth of Moving Objects in Video

08/02/2021
by   Zhoutong Zhang, et al.
3

We present a method to estimate depth of a dynamic scene, containing arbitrary moving objects, from an ordinary video captured with a moving camera. We seek a geometrically and temporally consistent solution to this underconstrained problem: the depth predictions of corresponding points across frames should induce plausible, smooth motion in 3D. We formulate this objective in a new test-time training framework where a depth-prediction CNN is trained in tandem with an auxiliary scene-flow prediction MLP over the entire input video. By recursively unrolling the scene-flow prediction MLP over varying time steps, we compute both short-range scene flow to impose local smooth motion priors directly in 3D, and long-range scene flow to impose multi-view consistency constraints with wide baselines. We demonstrate accurate and temporally coherent results on a variety of challenging videos containing diverse moving objects (pets, people, cars), as well as camera motion. Our depth maps give rise to a number of depth-and-motion aware video editing effects such as object and lighting insertion.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 9

page 10

research
04/25/2019

Learning the Depths of Moving People by Watching Frozen People

We present a method for predicting dense depth in scenarios where both a...
research
04/04/2023

Decoupling Dynamic Monocular Videos for Dynamic View Synthesis

The challenge of dynamic view synthesis from dynamic monocular videos, i...
research
05/14/2021

Omnimatte: Associating Objects and Their Effects in Video

Computer vision is increasingly effective at segmenting objects in image...
research
07/28/2016

Stereo Video Deblurring

Videos acquired in low-light conditions often exhibit motion blur, which...
research
07/17/2020

People as Scene Probes

By analyzing the motion of people and other objects in a scene, we demon...
research
12/08/2020

Vid2CAD: CAD Model Alignment using Multi-View Constraints from Videos

We address the task of aligning CAD models to a video sequence of a comp...
research
08/12/2017

Temporal Upsampling of Depth Maps Using a Hybrid Camera

In recent years consumer-level depth sensors have been adopted in variou...

Please sign up or login with your details

Forgot password? Click here to reset