Disentangling 3D Attributes from a Single 2D Image: Human Pose, Shape and Garment

08/05/2022
by   Xue Hu, et al.
0

For visual manipulation tasks, we aim to represent image content with semantically meaningful features. However, learning implicit representations from images often lacks interpretability, especially when attributes are intertwined. We focus on the challenging task of extracting disentangled 3D attributes only from 2D image data. Specifically, we focus on human appearance and learn implicit pose, shape and garment representations of dressed humans from RGB images. Our method learns an embedding with disentangled latent representations of these three image properties and enables meaningful re-assembling of features and property control through a 2D-to-3D encoder-decoder structure. The 3D model is inferred solely from the feature map in the learned embedding space. To the best of our knowledge, our method is the first to achieve cross-domain disentanglement for this highly under-constrained problem. We qualitatively and quantitatively demonstrate our framework's ability to transfer pose, shape, and garments in 3D reconstruction on virtual data and show how an implicit shape loss can benefit the model's ability to recover fine-grained reconstruction details.

READ FULL TEXT
research
03/29/2022

3D Shape Reconstruction from 2D Images with Disentangled Attribute Flow

Reconstructing 3D shape from a single 2D image is a challenging task, wh...
research
01/28/2019

PuppetGAN: Transferring Disentangled Properties from Synthetic to Real Images

In this work we propose a model that enables controlled manipulation of ...
research
11/30/2021

LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human Bodies

3D representation and reconstruction of human bodies have been studied f...
research
04/22/2021

Cross-Domain and Disentangled Face Manipulation with 3D Guidance

Face image manipulation via three-dimensional guidance has been widely a...
research
04/19/2021

LaLaLoc: Latent Layout Localisation in Dynamic, Unvisited Environments

We present LaLaLoc to localise in environments without the need for prio...
research
12/20/2016

From Images to 3D Shape Attributes

Our goal in this paper is to investigate properties of 3D shape that can...
research
12/06/2018

Cross-Domain 3D Equivariant Image Embeddings

Spherical convolutional networks have been introduced recently as tools ...

Please sign up or login with your details

Forgot password? Click here to reset