Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos

In this paper, we introduce a novel deep learning method for photo-realistic manipulation of the emotional state of actors in "in-the-wild" videos. The proposed method is based on a parametric 3D face representation of the actor in the input scene that offers a reliable disentanglement of the facial identity from the head pose and facial expressions. It then uses a novel deep domain translation framework that alters the facial expressions in a consistent and plausible manner, taking into account their dynamics. Finally, the altered facial expressions are used to photo-realistically manipulate the facial region in the input scene based on an especially-designed neural face renderer. To the best of our knowledge, our method is the first to be capable of controlling the actor's facial expressions by even using as a sole input the semantic labels of the manipulated emotions, while at the same time preserving the speech-related lip movements. We conduct extensive qualitative and quantitative evaluations and comparisons, which demonstrate the effectiveness of our approach and the especially promising results that we obtain. Our method opens a plethora of new possibilities for useful applications of neural rendering technologies, ranging from movie post-production and video games to photo-realistic affective avatars.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 7

page 13

page 15

page 16

research
11/15/2021

Deep Semantic Manipulation of Facial Videos

Editing and manipulating facial features in videos is an interesting and...
research
09/03/2022

Neural Sign Reenactor: Deep Photorealistic Sign Language Retargeting

In this paper, we introduce a neural rendering pipeline for transferring...
research
05/29/2018

Deep Video Portraits

We present a novel approach that enables photo-realistic re-animation of...
research
08/09/2018

Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation

The recent advances in deep learning have made it possible to generate p...
research
05/22/2020

Head2Head: Video-based Neural Head Synthesis

In this paper, we propose a novel machine learning architecture for faci...
research
06/17/2020

Head2Head++: Deep Facial Attributes Re-Targeting

Facial video re-targeting is a challenging problem aiming to modify the ...
research
09/16/2021

Invertable Frowns: Video-to-Video Facial Emotion Translation

We present Wav2Lip-Emotion, a video-to-video translation architecture th...

Please sign up or login with your details

Forgot password? Click here to reset