Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation

02/15/2023
by   Han Li, et al.
0

There has been a recent surge of interest in introducing transformers to 3D human pose estimation (HPE) due to their powerful capabilities in modeling long-term dependencies. However, existing transformer-based methods treat body joints as equally important inputs and ignore the prior knowledge of human skeleton topology in the self-attention mechanism. To tackle this issue, in this paper, we propose a Pose-Oriented Transformer (POT) with uncertainty guided refinement for 3D HPE. Specifically, we first develop novel pose-oriented self-attention mechanism and distance-related position embedding for POT to explicitly exploit the human skeleton topology. The pose-oriented self-attention mechanism explicitly models the topological interactions between body joints, whereas the distance-related position embedding encodes the distance of joints to the root joint to distinguish groups of joints with different difficulties in regression. Furthermore, we present an Uncertainty-Guided Refinement Network (UGRN) to refine pose predictions from POT, especially for the difficult joints, by considering the estimated uncertainty of each joint with uncertainty-guided sampling strategy and self-attention mechanism. Extensive experiments demonstrate that our method significantly outperforms the state-of-the-art methods with reduced model parameters on 3D HPE benchmarks such as Human3.6M and MPI-INF-3DHP

READ FULL TEXT

page 3

page 7

research
06/16/2023

EVOPOSE: A Recursive Transformer For 3D Human Pose Estimation With Kinematic Structure Priors

Transformer is popular in recent 3D human pose estimation, which utilize...
research
03/29/2021

TFPose: Direct Human Pose Estimation with Transformers

We propose a human pose estimation framework that solves the task in the...
research
08/24/2022

K-Order Graph-oriented Transformer with GraAttention for 3D Pose and Shape Estimation

We propose a novel attention-based 2D-to-3D pose estimation network for ...
research
12/24/2021

Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation

3D human pose and shape recovery from a monocular RGB image is a challen...
research
06/12/2023

Mitigating Transformer Overconfidence via Lipschitz Regularization

Though Transformers have achieved promising results in many computer vis...
research
04/17/2021

PARE: Part Attention Regressor for 3D Human Body Estimation

Despite significant progress, we show that state of the art 3D human pos...

Please sign up or login with your details

Forgot password? Click here to reset