Efficient Video Generation on Complex Datasets

07/15/2019
by   Aidan Clark, et al.
0

Generative models of natural images have progressed towards high fidelity samples by the strong leveraging of scale. We attempt to carry this success to the field of video modeling by showing that large Generative Adversarial Networks trained on the complex Kinetics-600 dataset are able to produce video samples of substantially higher complexity than previous work. Our proposed network, Dual Video Discriminator GAN (DVD-GAN), scales to longer and higher resolution videos by leveraging a computationally efficient decomposition of its discriminator. We evaluate on the related tasks of video synthesis and video prediction, and achieve new state of the art Frechet Inception Distance on prediction for Kinetics-600, as well as state of the art Inception Score for synthesis on the UCF-101 dataset, alongside establishing a number of strong baselines on Kinetics-600.

READ FULL TEXT

page 1

page 5

page 14

page 15

page 16

page 17

page 18

page 19

research
09/28/2018

Large Scale GAN Training for High Fidelity Natural Image Synthesis

Despite recent progress in generative image modeling, successfully gener...
research
04/10/2021

MobileStyleGAN: A Lightweight Convolutional Neural Network for High-Fidelity Image Synthesis

In recent years, the use of Generative Adversarial Networks (GANs) has b...
research
03/09/2020

Transformation-based Adversarial Video Prediction on Large-Scale Data

Recent breakthroughs in adversarial generative modeling have led to mode...
research
09/26/2021

Logo Generation Using Regional Features: A Faster R-CNN Approach to Generative Adversarial Networks

In this paper we introduce Local Logo Generative Adversarial Network (LL...
research
03/06/2019

High-Fidelity Image Generation With Fewer Labels

Deep generative models are becoming a cornerstone of modern machine lear...
research
04/20/2021

VideoGPT: Video Generation using VQ-VAE and Transformers

We present VideoGPT: a conceptually simple architecture for scaling like...
research
11/22/2018

TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers

In this paper, we propose a novel method to efficiently train a Generati...

Please sign up or login with your details

Forgot password? Click here to reset