DeepAI
Log In Sign Up

Deep Monocular 3D Human Pose Estimation via Cascaded Dimension-Lifting

04/08/2021
by   Changgong Zhang, et al.
0

The 3D pose estimation from a single image is a challenging problem due to depth ambiguity. One type of the previous methods lifts 2D joints, obtained by resorting to external 2D pose detectors, to the 3D space. However, this type of approaches discards the contextual information of images which are strong cues for 3D pose estimation. Meanwhile, some other methods predict the joints directly from monocular images but adopt a 2.5D output representation P^2.5D = (u,v,z^r) where both u and v are in the image space but z^r in root-relative 3D space. Thus, the ground-truth information (e.g., the depth of root joint from the camera) is normally utilized to transform the 2.5D output to the 3D space, which limits the applicability in practice. In this work, we propose a novel end-to-end framework that not only exploits the contextual information but also produces the output directly in the 3D space via cascaded dimension-lifting. Specifically, we decompose the task of lifting pose from 2D image space to 3D spatial space into several sequential sub-tasks, 1) kinematic skeletons & individual joints estimation in 2D space, 2) root-relative depth estimation, and 3) lifting to the 3D space, each of which employs direct supervisions and contextual image features to guide the learning process. Extensive experiments show that the proposed framework achieves state-of-the-art performance on two widely used 3D human pose datasets (Human3.6M, MuPoTS-3D).

READ FULL TEXT

page 1

page 3

07/17/2020

HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization

Current works on multi-person 3D pose estimation mainly focus on the est...
10/06/2017

Human Pose Regression by Combining Indirect Part Detection and Contextual Information

In this paper, we propose an end-to-end trainable regression approach fo...
07/16/2022

Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation

Inter-person occlusion and depth ambiguity make estimating the 3D poses ...
04/11/2019

Absolute Human Pose Estimation with Depth Prediction Network

The common approach to 3D human pose estimation is predicting the body j...
03/04/2021

Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration

Recent years have witnessed significant progress in 3D hand mesh recover...
04/08/2017

Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach

In this paper, we study the task of 3D human pose estimation in the wild...
08/06/2022

IVT: An End-to-End Instance-guided Video Transformer for 3D Pose Estimation

Video 3D human pose estimation aims to localize the 3D coordinates of hu...