Exploiting video sequences for unsupervised disentangling in generative adversarial networks

10/16/2019
by   Facundo Tuesca, et al.
0

In this work we present an adversarial training algorithm that exploits correlations in video to learn –without supervision– an image generator model with a disentangled latent space. The proposed methodology requires only a few modifications to the standard algorithm of Generative Adversarial Networks (GAN) and involves training with sets of frames taken from short videos. We train our model over two datasets of face-centered videos which present different people speaking or moving the head: VidTIMIT and YouTube Faces datasets. We found that our proposal allows us to split the generator latent space into two subspaces. One of them controls content attributes, those that do not change along short video sequences. For the considered datasets, this is the identity of the generated face. The other subspace controls motion attributes, those attributes that are observed to change along short videos. We observed that these motion attributes are face expressions, head orientation, lips and eyes movement. The presented experiments provide quantitative and qualitative evidence supporting that the proposed methodology induces a disentangling of this two kinds of attributes in the latent space.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

page 8

research
06/01/2023

We never go out of Style: Motion Disentanglement by Subspace Decomposition of Latent Space

Real-world objects perform complex motions that involve multiple indepen...
research
02/10/2019

A Generative 3D Facial Model by Adversarial Training

We consider data-driven generative models for the 3D face, and focus in ...
research
12/09/2017

CycleGAN Face-off

Face-off is an interesting case of style transfer where the facial expre...
research
01/18/2022

Autoencoding Video Latents for Adversarial Video Generation

Given the three dimensional complexity of a video signal, training a rob...
research
07/15/2021

StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN

Generative adversarial models (GANs) continue to produce advances in ter...
research
03/30/2022

High-resolution Face Swapping via Latent Semantics Disentanglement

We present a novel high-resolution face swapping method using the inhere...
research
08/11/2023

Head Rotation in Denoising Diffusion Models

Denoising Diffusion Models (DDM) are emerging as the cutting-edge techno...

Please sign up or login with your details

Forgot password? Click here to reset