ArrowGAN : Learning to Generate Videos by Learning Arrow of Time

01/11/2021
by   Kibeom Hong, et al.
40

Training GANs on videos is even more sophisticated than on images because videos have a distinguished dimension: time. While recent methods designed a dedicated architecture considering time, generated videos are still far from indistinguishable from real videos. In this paper, we introduce ArrowGAN framework, where the discriminators learns to classify arrow of time as an auxiliary task and the generators tries to synthesize forward-running videos. We argue that the auxiliary task should be carefully chosen regarding the target domain. In addition, we explore categorical ArrowGAN with recent techniques in conditional image generation upon ArrowGAN framework, achieving the state-of-the-art performance on categorical video generation. Our extensive experiments validate the effectiveness of arrow of time as a self-supervisory task, and demonstrate that all our components of categorical ArrowGAN lead to the improvement regarding video inception score and Frechet video distance on three datasets: Weizmann, UCFsports, and UCF-101.

READ FULL TEXT

page 2

page 3

page 6

page 8

page 9

page 10

research
04/01/2021

Collaborative Learning to Generate Audio-Video Jointly

There have been a number of techniques that have demonstrated the genera...
research
10/04/2018

Towards High Resolution Video Generation with Progressive Growing of Sliced Wasserstein GANs

The extension of image generation to video generation turns out to be a ...
research
05/06/2023

Multi-object Video Generation from Single Frame Layouts

In this paper, we study video synthesis with emphasis on simplifying the...
research
08/21/2019

Preserving Semantic and Temporal Consistency for Unpaired Video-to-Video Translation

In this paper, we investigate the problem of unpaired video-to-video tra...
research
02/08/2018

Learning to score the figure skating sports videos

This paper targets at learning to score the figure skating sports videos...
research
05/23/2020

Self-Training for Domain Adaptive Scene Text Detection

Though deep learning based scene text detection has achieved great progr...
research
08/30/2021

StackGAN: Facial Image Generation Optimizations

Current state-of-the-art photorealistic generators are computationally e...

Please sign up or login with your details

Forgot password? Click here to reset