PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN

by   Kai-En Lin, et al.

Portrait synthesis creates realistic digital avatars which enable users to interact with others in a compelling way. Recent advances in StyleGAN and its extensions have shown promising results in synthesizing photorealistic and accurate reconstruction of human faces. However, previous methods often focus on frontal face synthesis and most methods are not able to handle large head rotations due to the training data distribution of StyleGAN. In this work, our goal is to take as input a monocular video of a face, and create an editable dynamic portrait able to handle extreme head poses. The user can create novel viewpoints, edit the appearance, and animate the face. Our method utilizes pivotal tuning inversion (PTI) to learn a personalized video prior from a monocular video sequence. Then we can input pose and expression coefficients to MLPs and manipulate the latent vectors to synthesize different viewpoints and expressions of the subject. We also propose novel loss functions to further disentangle pose and expression in the latent space. Our algorithm shows much better performance over previous approaches on monocular video datasets, and it is also capable of running in real-time at 54 FPS on an RTX 3080.


page 1

page 4

page 6

page 9

page 10

page 11

page 14

page 15


High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors

High-fidelity facial avatar reconstruction from a monocular video is a s...

Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation

Despite recent advances in appearance-based gaze estimation techniques, ...

LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar

Existing approaches to animatable NeRF-based head avatars are either bui...

Text-based Editing of Talking-head Video

Editing talking-head video to change the speech content or to remove fil...

Mask-off: Synthesizing Face Images in the Presence of Head-mounted Displays

A head-mounted display (HMD) could be an important component of augmente...

Riggable 3D Face Reconstruction via In-Network Optimization

This paper presents a method for riggable 3D face reconstruction from mo...

InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds

In this paper, we take a significant step towards real-world applicabili...

Please sign up or login with your details

Forgot password? Click here to reset