Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

04/11/2019
by   Georgios Pavlakos, et al.
34

To facilitate the analysis of human actions, interactions and emotions, we compute a 3D model of human body pose, hand pose, and facial expression from a single monocular image. To achieve this, we use thousands of 3D scans to train a new, unified, 3D model of the human body, SMPL-X, that extends SMPL with fully articulated hands and an expressive face. Learning to regress the parameters of SMPL-X directly from images is challenging without paired images and 3D ground truth. Consequently, we follow the approach of SMPLify, which estimates 2D features and then optimizes model parameters to fit the features. We improve on SMPLify in several significant ways: (1) we detect 2D features corresponding to the face, hands, and feet and fit the full SMPL-X model to these; (2) we train a new neural network pose prior using a large MoCap dataset; (3) we define a new interpenetration penalty that is both fast and accurate; (4) we automatically detect gender and the appropriate body models (male, female, or neutral); (5) our PyTorch implementation achieves a speedup of more than 8x over Chumpy. We use the new method, SMPLify-X, to fit SMPL-X to both controlled images and images in the wild. We evaluate 3D accuracy on a new curated dataset comprising 100 images with pseudo ground-truth. This is a step towards automatic expressive human capture from monocular RGB data. The models, code, and data are available for research purposes at https://smpl-x.is.tue.mpg.de.

READ FULL TEXT

page 1

page 2

page 8

page 12

page 13

page 16

page 20

page 21

research
03/05/2021

Real-time RGBD-based Extended Body Pose Estimation

We present a system for real-time RGBD-based estimation of 3D human pose...
research
05/16/2019

Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

The estimation of 3D face shape from a single image must be robust to va...
research
08/20/2020

Monocular Expressive Body Regression through Body-Driven Attention

To understand how people look, interact, or perform tasks, we need to qu...
research
12/11/2020

Monocular Real-time Full Body Capture with Inter-part Correlations

We present the first method for real-time full body capture that estimat...
research
05/11/2021

Collaborative Regression of Expressive Bodies using Moderation

Recovering expressive humans from images is essential for understanding ...
research
01/29/2021

Neural 3D Clothes Retargeting from a Single Image

In this paper, we present a method of clothes retargeting; generating th...
research
01/31/2017

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

We propose a deep multitask architecture for fully automatic 2d and 3d h...

Please sign up or login with your details

Forgot password? Click here to reset