LiP-Flow: Learning Inference-time Priors for Codec Avatars via Normalizing Flows in Latent Space

03/15/2022
by   Emre Aksan, et al.
2

Neural face avatars that are trained from multi-view data captured in camera domes can produce photo-realistic 3D reconstructions. However, at inference time, they must be driven by limited inputs such as partial views recorded by headset-mounted cameras or a front-facing camera, and sparse facial landmarks. To mitigate this asymmetry, we introduce a prior model that is conditioned on the runtime inputs and tie this prior space to the 3D face model via a normalizing flow in the latent space. Our proposed model, LiP-Flow, consists of two encoders that learn representations from the rich training-time and impoverished inference-time observations. A normalizing flow bridges the two representation spaces and transforms latent samples from one domain to another, allowing us to define a latent likelihood objective. We trained our model end-to-end to maximize the similarity of both representation spaces and the reconstruction quality, making the 3D face model aware of the limited driving signals. We conduct extensive evaluations where the latent codes are optimized to reconstruct 3D avatars from partial or sparse observations. We show that our approach leads to an expressive and effective prior, capturing facial dynamics and subtle expressions better.

READ FULL TEXT

page 3

page 8

page 9

page 19

page 23

page 24

page 25

research
11/28/2022

High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors

High-fidelity facial avatar reconstruction from a monocular video is a s...
research
01/23/2023

A Tale of Two Latent Flows: Learning Latent Space Normalizing Flow with Short-run Langevin Flow for Approximate Inference

We study a normalizing flow in the latent space of a top-down generator ...
research
07/19/2021

Synthesizing Human Faces using Latent Space Factorization and Local Weights (Extended Version)

We propose a 3D face generative model with local weights to increase the...
research
06/27/2022

Video2StyleGAN: Encoding Video in Latent Space for Manipulation

Many recent works have been proposed for face image editing by leveragin...
research
06/18/2012

Manifold Relevance Determination

In this paper we present a fully Bayesian latent variable model which ex...
research
07/30/2017

Kernel Projection of Latent Structures Regression for Facial Animation Retargeting

Inspired by kernel methods that have been used extensively in achieving ...
research
09/22/2022

Assessing Robustness of EEG Representations under Data-shifts via Latent Space and Uncertainty Analysis

The recent availability of large datasets in bio-medicine has inspired t...

Please sign up or login with your details

Forgot password? Click here to reset