Stochastic Video Prediction with Structure and Motion

03/20/2022
by   Adil Kaan Akan, et al.
4

While stochastic video prediction models enable future prediction under uncertainty, they mostly fail to model the complex dynamics of real-world scenes. For example, they cannot provide reliable predictions for scenes with a moving camera and independently moving foreground objects in driving scenarios. The existing methods fail to fully capture the dynamics of the structured world by only focusing on changes in pixels. In this paper, we assume that there is an underlying process creating observations in a video and propose to factorize it into static and dynamic components. We model the static part based on the scene structure and the ego-motion of the vehicle, and the dynamic part based on the remaining motion of the dynamic objects. By learning separate distributions of changes in foreground and background, we can decompose the scene into static and dynamic parts and separately model the change in each. Our experiments demonstrate that disentangling structure and motion helps stochastic video prediction, leading to better future predictions in complex driving scenarios on two real-world driving datasets, KITTI and Cityscapes.

READ FULL TEXT

page 8

page 19

page 20

page 21

page 22

page 23

page 25

page 27

research
08/05/2021

SLAMP: Stochastic Latent Appearance and Motion Prediction

Motion is an important cue for video prediction and often utilized by se...
research
03/18/2022

Discovering Objects that Can Move

This paper studies the problem of object discovery – separating objects ...
research
10/16/2022

Stochastic Occupancy Grid Map Prediction in Dynamic Scenes

This paper presents two variations of a novel stochastic prediction algo...
research
03/02/2023

Semantic Attention Flow Fields for Dynamic Scene Decomposition

We present SAFF: a dynamic neural volume reconstruction of a casual mono...
research
06/25/2018

Unsupervised Learning of Sensorimotor Affordances by Stochastic Future Prediction

Recently, much progress has been made building systems that can capture ...
research
11/20/2018

Learning to Detect Instantaneous Changes with Retrospective Convolution and Static Sample Synthesis

Change detection has been a challenging visual task due to the dynamic n...
research
09/14/2023

OmnimatteRF: Robust Omnimatte with 3D Background Modeling

Video matting has broad applications, from adding interesting effects to...

Please sign up or login with your details

Forgot password? Click here to reset