Fourier-based Video Prediction through Relational Object Motion

10/12/2021
by   Malte Mosbach, et al.
9

The ability to predict future outcomes conditioned on observed video frames is crucial for intelligent decision-making in autonomous systems. Recently, deep recurrent architectures have been applied to the task of video prediction. However, this often results in blurry predictions and requires tedious training on large datasets. Here, we explore a different approach by (1) using frequency-domain approaches for video prediction and (2) explicitly inferring object-motion relationships in the observed scene. The resulting predictions are consistent with the observed dynamics in a scene and do not suffer from blur.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2021

Learning Semantic-Aware Dynamics for Video Prediction

We propose an architecture and training scheme to predict video frames b...
research
05/10/2021

Local Frequency Domain Transformer Networks for Video Prediction

Video prediction is commonly referred to as forecasting future frames of...
research
06/06/2019

3D-RelNet: Joint Object and Relational Network for 3D Prediction

We propose an approach to predict the 3D shape and pose for the objects ...
research
04/21/2022

Learning Future Object Prediction with a Spatiotemporal Detection Transformer

We explore future object prediction – a challenging problem where all ob...
research
03/17/2022

Video Prediction at Multiple Scales with Hierarchical Recurrent Networks

Autonomous systems not only need to understand their current environment...
research
07/03/2020

Video Prediction via Example Guidance

In video prediction tasks, one major challenge is to capture the multi-m...
research
08/22/2019

Compositional Video Prediction

We present an approach for pixel-level future prediction given an input ...

Please sign up or login with your details

Forgot password? Click here to reset