Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation

07/07/2023
by   Zhongyu Jiang, et al.
0

Learning-based methods have dominated the 3D human pose estimation (HPE) tasks with significantly better performance in most benchmarks than traditional optimization-based methods. Nonetheless, 3D HPE in the wild is still the biggest challenge of learning-based models, whether with 2D-3D lifting, image-to-3D, or diffusion-based methods, since the trained networks implicitly learn camera intrinsic parameters and domain-based 3D human pose distributions and estimate poses by statistical average. On the other hand, the optimization-based methods estimate results case-by-case, which can predict more diverse and sophisticated human poses in the wild. By combining the advantages of optimization-based and learning-based methods, we propose the Zero-shot Diffusion-based Optimization (ZeDO) pipeline for 3D HPE to solve the problem of cross-domain and in-the-wild 3D HPE. Our multi-hypothesis ZeDO achieves state-of-the-art (SOTA) performance on Human3.6M as minMPJPE 51.4mm without training with any 2D-3D or image-3D pairs. Moreover, our single-hypothesis ZeDO achieves SOTA performance on 3DPW dataset with PA-MPJPE 42.6mm on cross-dataset evaluation, which even outperforms learning-based methods trained on 3DPW.

READ FULL TEXT
research
01/08/2023

CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D Annotations

To improve the generalization of 3D human pose estimators, many existing...
research
05/29/2023

3D Model-based Zero-Shot Pose Estimation Pipeline

Most existing learning-based pose estimation methods are typically devel...
research
11/23/2020

NeuralAnnot: Neural Annotator for in-the-wild Expressive 3D Human Pose and Mesh Training Sets

Recovering expressive 3D human pose and mesh from in-the-wild images is ...
research
10/20/2022

Multi-hypothesis 3D human pose estimation metrics favor miscalibrated distributions

Due to depth ambiguities and occlusions, lifting 2D poses to 3D is a hig...
research
11/29/2022

DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion models

Traditionally, monocular 3D human pose estimation employs a machine lear...
research
03/24/2021

AcinoSet: A 3D Pose Estimation Dataset and Baseline Models for Cheetahs in the Wild

Animals are capable of extreme agility, yet understanding their complex ...
research
04/30/2012

Parametric annealing: a stochastic search method for human pose tracking

Model based methods to marker-free motion capture have a very high compu...

Please sign up or login with your details

Forgot password? Click here to reset