Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation

08/20/2019
by   Sun Yu, et al.
4

We describe an end-to-end method for recovering 3D human body mesh from single images and monocular videos. Different from the existing methods try to obtain all the complex 3D pose, shape, and camera parameters from one coupling feature, we propose a skeleton-disentangling based framework, which divides this task into multi-level spatial and temporal granularity in a decoupling manner. In spatial, we propose an effective and pluggable "disentangling the skeleton from the details" (DSD) module. It reduces the complexity and decouples the skeleton, which lays a good foundation for temporal modeling. In temporal, the self-attention based temporal convolution network is proposed to efficiently exploit the short and long-term temporal cues. Furthermore, an unsupervised adversarial training strategy, temporal shuffles and order recovery, is designed to promote the learning of motion dynamics. The proposed method outperforms the state-of-the-art 3D human mesh recovery methods by 15.4 MPJPE and 23.8 achieved on the 3D pose in the wild (3DPW) dataset without any fine-tuning. Especially, ablation studies demonstrate that skeleton-disentangled representation is crucial for better temporal modeling and generalization.

READ FULL TEXT

page 3

page 6

page 7

research
08/17/2020

Spatial Temporal Transformer Network for Skeleton-based Action Recognition

Skeleton-based Human Activity Recognition has achieved a great interest ...
research
12/29/2018

Skeleton Transformer Networks: 3D Human Pose and Skinned Mesh from Single RGB Image

In this paper, we present Skeleton Transformer Networks (SkeletonNet), a...
research
03/16/2021

PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos

The end-to-end Human Mesh Recovery (HMR) approach has been successfully ...
research
12/19/2021

MoCaNet: Motion Retargeting in-the-wild via Canonicalization Networks

We present a novel framework that brings the 3D motion retargeting task ...
research
03/10/2023

GATOR: Graph-Aware Transformer with Motion-Disentangled Regression for Human Mesh Recovery from a 2D Pose

3D human mesh recovery from a 2D pose plays an important role in various...
research
12/06/2018

Unsupervised Feature Learning of Human Actions as Trajectories in Pose Embedding Manifold

An unsupervised human action modeling framework can provide useful pose-...
research
10/27/2020

Synthetic Training for Monocular Human Mesh Recovery

Recovering 3D human mesh from monocular images is a popular topic in com...

Please sign up or login with your details

Forgot password? Click here to reset