Audio-driven Neural Gesture Reenactment with Video Motion Graphs

07/23/2022
by   Yang Zhou, et al.
0

Human speech is often accompanied by body gestures including arm and hand gestures. We present a method that reenacts a high-quality video with gestures matching a target speech audio. The key idea of our method is to split and re-assemble clips from a reference video through a novel video motion graph encoding valid transitions between clips. To seamlessly connect different clips in the reenactment, we propose a pose-aware video blending network which synthesizes video frames around the stitched frames between two clips. Moreover, we developed an audio-based gesture searching algorithm to find the optimal order of the reenacted frames. Our system generates reenactments that are consistent with both the audio rhythms and the speech content. We evaluate our synthesized video quality quantitatively, qualitatively, and with user studies, demonstrating that our method produces videos of much higher quality and consistency with the target audio compared to previous work and baselines.

READ FULL TEXT

page 2

page 3

page 8

page 9

page 10

page 11

page 12

page 15

research
12/05/2022

Audio-Driven Co-Speech Gesture Video Generation

Co-speech gesture is crucial for human-machine interaction and digital e...
research
06/13/2020

Dynamic gesture retrieval: searching videos by human pose sequence

The number of static human poses is limited, it is hard to retrieve the ...
research
06/10/2019

Learning Individual Styles of Conversational Gesture

Human speech is often accompanied by hand and arm gestures. Given audio ...
research
05/05/2022

Deep Neural Network approaches for Analysing Videos of Music Performances

This paper presents a framework to automate the labelling process for ge...
research
12/19/2017

Audio to Body Dynamics

We present a method that gets as input an audio of violin or piano playi...
research
07/04/2019

LumièreNet: Lecture Video Synthesis from Audio

We present LumièreNet, a simple, modular, and completely deep-learning b...
research
01/17/2023

Audio2Gestures: Generating Diverse Gestures from Audio

People may perform diverse gestures affected by various mental and physi...

Please sign up or login with your details

Forgot password? Click here to reset