Double-chain Constraints for 3D Human Pose Estimation in Images and Videos

08/10/2023
by   Hongbo Kang, et al.
0

Reconstructing 3D poses from 2D poses lacking depth information is particularly challenging due to the complexity and diversity of human motion. The key is to effectively model the spatial constraints between joints to leverage their inherent dependencies. Thus, we propose a novel model, called Double-chain Graph Convolutional Transformer (DC-GCT), to constrain the pose through a double-chain design consisting of local-to-global and global-to-local chains to obtain a complex representation more suitable for the current human pose. Specifically, we combine the advantages of GCN and Transformer and design a Local Constraint Module (LCM) based on GCN and a Global Constraint Module (GCM) based on self-attention mechanism as well as a Feature Interaction Module (FIM). The proposed method fully captures the multi-level dependencies between human body joints to optimize the modeling capability of the model. Moreover, we propose a method to use temporal information into the single-frame model by guiding the video sequence embedding through the joint embedding of the target frame, with negligible increase in computational cost. Experimental results demonstrate that DC-GCT achieves state-of-the-art performance on two challenging datasets (Human3.6M and MPI-INF-3DHP). Notably, our model achieves state-of-the-art performance on all action categories in the Human3.6M dataset using detected 2D poses from CPN, and our code is available at: https://github.com/KHB1698/DC-GCT.

READ FULL TEXT

page 1

page 9

page 10

research
01/18/2023

HSTFormer: Hierarchical Spatial-Temporal Transformers for 3D Human Pose Estimation

Transformer-based approaches have been successfully proposed for 3D huma...
research
03/15/2023

PoseRAC: Pose Saliency Transformer for Repetitive Action Counting

This paper presents a significant contribution to the field of repetitiv...
research
04/27/2023

Interweaved Graph and Attention Network for 3D Human Pose Estimation

Despite substantial progress in 3D human pose estimation from a single-v...
research
02/20/2023

HTNet: Human Topology Aware Network for 3D Human Pose Estimation

3D human pose estimation errors would propagate along the human body top...
research
04/17/2023

ViPLO: Vision Transformer based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection

Human-Object Interaction (HOI) detection, which localizes and infers rel...
research
06/13/2022

GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation

Modern multi-layer perceptron (MLP) models have shown competitive result...
research
10/09/2021

Space-Time-Separable Graph Convolutional Network for Pose Forecasting

Human pose forecasting is a complex structured-data sequence-modelling t...

Please sign up or login with your details

Forgot password? Click here to reset