Towards Accurate Generative Models of Video: A New Metric & Challenges

12/03/2018
by   Thomas Unterthiner, et al.
0

Recent advances in deep generative models have lead to remarkable progress in synthesizing high quality images. Following their successful application in image processing and representation learning, an important next step is to consider videos. Learning generative models of video is a much harder task, requiring a model to capture the temporal dynamics of a scene, in addition to the visual presentation of objects. Although recent attempts at formulating generative models of video have had some success, current progress is hampered by (1) the lack of qualitative metrics that consider visual quality, temporal coherence, and diversity of samples, and (2) the wide gap between purely synthetic video datasets and challenging real-world datasets in terms of complexity. To this extent we propose Fréchet Video Distance (FVD), a new metric for generative models of video based on FID, and StarCraft 2 Videos (SCV), a collection of progressively harder datasets that challenge the capabilities of the current iteration of generative models for video. We conduct a large-scale human study, which confirms that FVD correlates well with qualitative human judgment of generated videos, and provide initial benchmark results on SCV.

READ FULL TEXT

page 1

page 4

page 6

page 11

page 13

page 14

page 15

page 16

research
11/21/2021

Video Content Swapping Using GAN

Video generation is an interesting problem in computer vision. It is qui...
research
10/17/2018

A Case for Object Compositionality in Deep Generative Models of Images

Deep generative models seek to recover the process with which the observ...
research
01/06/2018

A Note on the Inception Score

Deep generative models are powerful tools that have produced impressive ...
research
05/20/2022

Diversity vs. Recognizability: Human-like generalization in one-shot generative models

Robust generalization to new concepts has long remained a distinctive fe...
research
03/02/2023

Counterfactual Edits for Generative Evaluation

Evaluation of generative models has been an underrepresented field despi...
research
08/27/2016

Learning Temporal Transformations From Time-Lapse Videos

Based on life-long observations of physical, chemical, and biologic phen...
research
09/07/2015

An end-to-end generative framework for video segmentation and recognition

We describe an end-to-end generative approach for the segmentation and r...

Please sign up or login with your details

Forgot password? Click here to reset