Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis

10/22/2019
by   Patrick Esser, et al.
14

Deep generative models come with the promise to learn an explainable representation for visual objects that allows image sampling, synthesis, and selective modification. The main challenge is to learn to properly model the independent latent characteristics of an object, especially its appearance and pose. We present a novel approach that learns disentangled representations of these characteristics and explains them individually. Training requires only pairs of images depicting the same object appearance, but no pose annotations. We propose an additional classifier that estimates the minimal amount of regularization required to enforce disentanglement. Thus both representations together can completely explain an image while being independent of each other. Previous methods based on adversarial approaches fail to enforce this independence, while methods based on variational approaches lead to uninformative representations. In experiments on diverse object categories, the approach successfully recombines pose and appearance to reconstruct and retarget novel synthesized images. We achieve significant improvements over state-of-the-art methods which utilize the same level of supervision, and reach performances comparable to those of pose-supervised approaches. However, we can handle the vast body of articulated object classes for which no pose models/annotations are available.

READ FULL TEXT

page 2

page 5

page 7

page 14

page 18

page 19

page 20

page 21

research
04/17/2018

DGPose: Disentangled Semi-supervised Deep Generative Models for Human Body Analysis

Deep generative modelling for robust human body analysis is an emerging ...
research
03/16/2019

Unsupervised Part-Based Disentangling of Object Shape and Appearance

Large intra-class variation is the result of changes in multiple object ...
research
04/12/2018

A Variational U-Net for Conditional Appearance and Shape Generation

Deep generative models have demonstrated great performance in image synt...
research
09/20/2023

Understanding Pose and Appearance Disentanglement in 3D Human Pose Estimation

As 3D human pose estimation can now be achieved with very high accuracy ...
research
04/16/2019

Disentangling Pose from Appearance in Monochrome Hand Images

Hand pose estimation from the monocular 2D image is challenging due to t...
research
06/28/2018

Robust pose tracking with a joint model of appearance and shape

We present a novel approach for estimating the 2D pose of an articulated...
research
11/10/2022

DisPositioNet: Disentangled Pose and Identity in Semantic Image Manipulation

Graph representation of objects and their relations in a scene, known as...

Please sign up or login with your details

Forgot password? Click here to reset