Neural Style-Preserving Visual Dubbing

09/05/2019
by   Hyeongwoo Kim, et al.
18

Dubbing is a technique for translating video content from one language to another. However, state-of-the-art visual dubbing techniques directly copy facial expressions from source to target actors without considering identity-specific idiosyncrasies such as a unique type of smile. We present a style-preserving visual dubbing approach from single video inputs, which maintains the signature style of target actors when modifying facial expressions, including mouth motions, to match foreign languages. At the heart of our approach is the concept of motion style, in particular for facial expressions, i.e., the person-specific expression change that is yet another essential factor beyond visual accuracy in face editing applications. Our method is based on a recurrent generative adversarial network that captures the spatiotemporal co-activation of facial expressions, and enables generating and modifying the facial expressions of the target actor while preserving their style. We train our model with unsynchronized source and target videos in an unsupervised manner using cycle-consistency and mouth expression losses, and synthesize photorealistic video frames using a layered neural face renderer. Our approach generates temporally coherent results, and handles dynamic backgrounds. Our results show that our dubbing approach maintains the idiosyncratic style of the target actor better than previous approaches, even for widely differing source and target actors.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

page 9

page 10

page 11

research
12/30/2022

Imitator: Personalized Speech-driven 3D Facial Animation

Speech-driven 3D facial animation has been widely explored, with applica...
research
07/29/2020

Face2Face: Real-time Face Capture and Reenactment of RGB Videos

We present Face2Face, a novel approach for real-time facial reenactment ...
research
12/18/2017

IMU2Face: Real-time Gesture-driven Facial Reenactment

We present IMU2Face, a gesture-driven facial reenactment system. To this...
research
02/08/2016

Automatic Face Reenactment

We propose an image-based, facial reenactment system that replaces the f...
research
12/09/2017

CycleGAN Face-off

Face-off is an interesting case of style transfer where the facial expre...
research
11/21/2020

Iterative Text-based Editing of Talking-heads Using Neural Retargeting

We present a text-based tool for editing talking-head video that enables...
research
05/22/2020

Head2Head: Video-based Neural Head Synthesis

In this paper, we propose a novel machine learning architecture for faci...

Please sign up or login with your details

Forgot password? Click here to reset