LARNet: Latent Action Representation for Human Action Synthesis

10/21/2021
by   Naman Biyani, et al.
0

We present LARNet, a novel end-to-end approach for generating human action videos. A joint generative modeling of appearance and dynamics to synthesize a video is very challenging and therefore recent works in video synthesis have proposed to decompose these two factors. However, these methods require a driving video to model the video dynamics. In this work, we propose a generative approach instead, which explicitly learns action dynamics in latent space avoiding the need of a driving video during inference. The generated action dynamics is integrated with the appearance using a recurrent hierarchical structure which induces motion at different scales to focus on both coarse as well as fine level action details. In addition, we propose a novel mix-adversarial loss function which aims at improving the temporal coherency of synthesized videos. We evaluate the proposed approach on four real-world human action datasets demonstrating the effectiveness of the proposed approach in generating human actions. The code and models will be made publicly available.

READ FULL TEXT
research
06/27/2020

Compositional Video Synthesis with Action Graphs

Videos of actions are complex spatio-temporal signals, containing rich c...
research
01/28/2021

Playable Video Generation

This paper introduces the unsupervised learning problem of playable vide...
research
03/08/2021

Behavior-Driven Synthesis of Human Dynamics

Generating and representing human behavior are of major importance for v...
research
05/20/2017

Responsive Action-based Video Synthesis

We propose technology to enable a new medium of expression, where video ...
research
10/15/2021

Pose-guided Generative Adversarial Net for Novel View Action Synthesis

We focus on the problem of novel-view human action synthesis. Given an a...
research
08/26/2023

Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models

Text-to-video (T2V) synthesis has gained increasing attention in the com...
research
08/08/2016

Discriminatively Trained Latent Ordinal Model for Video Classification

We study the problem of video classification for facial analysis and hum...

Please sign up or login with your details

Forgot password? Click here to reset