Deep Video Generation, Prediction and Completion of Human Action Sequences

11/23/2017
by   Haoye Cai, et al.
0

Current deep learning results on video generation are limited while there are only a few first results on video prediction and no relevant significant results on video completion. This is due to the severe ill-posedness inherent in these three problems. In this paper, we focus on human action videos, and propose a general, two-stage deep framework to generate human action videos with no constraints or arbitrary number of constraints, which uniformly address the three problems: video generation given no input frames, video prediction given the first few frames, and video completion given the first and last frames. To make the problem tractable, in the first stage we train a deep generative model that generates a human pose sequence from random noise. In the second stage, a skeleton-to-image network is trained, which is used to generate a human action video given the complete human pose sequence generated in the first stage. By introducing the two-stage strategy, we sidestep the original ill-posed problems while producing for the first time high-quality video generation/prediction/completion results of much longer duration. We present quantitative and qualitative evaluation to show that our two-stage approach outperforms state-of-the-art methods in video generation, prediction and video completion. Our video result demonstration can be viewed at https://iamacewhite.github.io/supp/index.html

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2018

Pose Guided Human Video Generation

Due to the emergence of Generative Adversarial Networks, video synthesis...
research
07/26/2018

Learning to Forecast and Refine Residual Motion for Image-to-Video Generation

We consider the problem of image-to-video translation, where an input im...
research
08/23/2018

Deep Portrait Image Completion and Extrapolation

General image completion and extrapolation methods often fail on portrai...
research
11/23/2022

Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation

Generating a video given the first several static frames is challenging ...
research
08/28/2023

MagicAvatar: Multimodal Avatar Generation and Animation

This report presents MagicAvatar, a framework for multimodal video gener...
research
07/25/2021

Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos

Given a video demonstration, can we imitate the action contained in this...
research
08/30/2019

Generating Persuasive Visual Storylines for Promotional Videos

Video contents have become a critical tool for promoting products in E-c...

Please sign up or login with your details

Forgot password? Click here to reset