PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN

06/29/2023
by   Kai-En Lin, et al.
0

Portrait synthesis creates realistic digital avatars which enable users to interact with others in a compelling way. Recent advances in StyleGAN and its extensions have shown promising results in synthesizing photorealistic and accurate reconstruction of human faces. However, previous methods often focus on frontal face synthesis and most methods are not able to handle large head rotations due to the training data distribution of StyleGAN. In this work, our goal is to take as input a monocular video of a face, and create an editable dynamic portrait able to handle extreme head poses. The user can create novel viewpoints, edit the appearance, and animate the face. Our method utilizes pivotal tuning inversion (PTI) to learn a personalized video prior from a monocular video sequence. Then we can input pose and expression coefficients to MLPs and manipulate the latent vectors to synthesize different viewpoints and expressions of the subject. We also propose novel loss functions to further disentangle pose and expression in the latent space. Our algorithm shows much better performance over previous approaches on monocular video datasets, and it is also capable of running in real-time at 54 FPS on an RTX 3080.

READ FULL TEXT

page 1

page 4

page 6

page 9

page 10

page 11

page 14

page 15

research
11/28/2022

High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors

High-fidelity facial avatar reconstruction from a monocular video is a s...
research
01/20/2022

Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation

Despite recent advances in appearance-based gaze estimation techniques, ...
research
05/02/2023

LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar

Existing approaches to animatable NeRF-based head avatars are either bui...
research
06/04/2019

Text-based Editing of Talking-head Video

Editing talking-head video to change the speech content or to remove fil...
research
10/26/2016

Mask-off: Synthesizing Face Images in the Presence of Head-mounted Displays

A head-mounted display (HMD) could be an important component of augmente...
research
04/08/2021

Riggable 3D Face Reconstruction via In-Network Optimization

This paper presents a method for riggable 3D face reconstruction from mo...
research
12/20/2022

InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds

In this paper, we take a significant step towards real-world applicabili...

Please sign up or login with your details

Forgot password? Click here to reset