DeepAI AI Chat
Log In Sign Up

Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation

by   Jianfeng Zhang, et al.

Existing 3D human pose estimation models suffer performance drop when applying to new scenarios with unseen poses due to their limited generalizability. In this work, we propose a novel framework, Inference Stage Optimization (ISO), for improving the generalizability of 3D pose models when source and target data come from different pose distributions. Our main insight is that the target data, even though not labeled, carry valuable priors about their underlying distribution. To exploit such information, the proposed ISO performs geometry-aware self-supervised learning (SSL) on each single target instance and updates the 3D pose model before making prediction. In this way, the model can mine distributional knowledge about the target scenario and quickly adapt to it with enhanced generalization performance. In addition, to handle sequential target data, we propose an online mode for implementing our ISO framework via streaming the SSL, which substantially enhances its effectiveness. We systematically analyze why and how our ISO framework works on diverse benchmarks under cross-scenario setup. Remarkably, it yields new state-of-the-art of 83.6 best result by 9.7


page 1

page 2

page 3

page 4


Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis

Camera captured human pose is an outcome of several sources of variation...

Adapted Human Pose: Monocular 3D Human Pose Estimation with Zero Real 3D Pose Data

The ultimate goal for an inference model is to be robust and functional ...

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

Existing self-supervised 3D human pose estimation schemes have largely r...

A Unified Framework for Domain Adaptive Pose Estimation

While pose estimation is an important computer vision task, it requires ...

Non-Local Latent Relation Distillation for Self-Adaptive 3D Human Pose Estimation

Available 3D human pose estimation approaches leverage different forms o...

Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation

When applying a pre-trained 2D-to-3D human pose lifting model to a targe...

Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses

Many prediction tasks contain uncertainty. In some cases, uncertainty is...