Temporal View Synthesis of Dynamic Scenes through 3D Object Motion Estimation with Multi-Plane Images

08/19/2022
by   Nagabhushan Somraj, et al.
0

The challenge of graphically rendering high frame-rate videos on low compute devices can be addressed through periodic prediction of future frames to enhance the user experience in virtual reality applications. This is studied through the problem of temporal view synthesis (TVS), where the goal is to predict the next frames of a video given the previous frames and the head poses of the previous and the next frames. In this work, we consider the TVS of dynamic scenes in which both the user and objects are moving. We design a framework that decouples the motion into user and object motion to effectively use the available user motion while predicting the next frames. We predict the motion of objects by isolating and estimating the 3D object motion in the past frames and then extrapolating it. We employ multi-plane images (MPI) as a 3D representation of the scenes and model the object motion as the 3D displacement between the corresponding points in the MPI representation. In order to handle the sparsity in MPIs while estimating the motion, we incorporate partial convolutions and masked correlation layers to estimate corresponding points. The predicted object motion is then integrated with the given user or camera motion to generate the next frame. Using a disocclusion infilling module, we synthesize the regions uncovered due to the camera and object motion. We develop a new synthetic dataset for TVS of dynamic scenes consisting of 800 videos at full HD resolution. We show through experiments on our dataset and the MPI Sintel dataset that our model outperforms all the competing methods in the literature.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 8

research
06/16/2021

Unsupervised Video Prediction from a Single Frame by Estimating 3D Dynamic Scene Structure

Our goal in this work is to generate realistic videos given just one ini...
research
10/17/2021

Revealing Disocclusions in Temporal View Synthesis through Infilling Vector Prediction

We consider the problem of temporal view synthesis, where the goal is to...
research
07/24/2023

ExWarp: Extrapolation and Warping-based Temporal Supersampling for High-frequency Displays

High-frequency displays are gaining immense popularity because of their ...
research
03/10/2023

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction

The Multiplane Image (MPI), containing a set of fronto-parallel RGBA lay...
research
05/15/2018

Topological Eulerian Synthesis of Slow Motion Periodic Videos

We consider the problem of taking a video that is comprised of multiple ...
research
03/29/2023

DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking

Recent multi-camera 3D object detectors usually leverage temporal inform...
research
07/02/2020

Understanding Road Layout from Videos as a Whole

In this paper, we address the problem of inferring the layout of complex...

Please sign up or login with your details

Forgot password? Click here to reset