Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos

09/15/2021
by   Junhao Zhang, et al.
5

Graph Convolution Network (GCN) has been successfully used for 3D human pose estimation in videos. However, it is often built on the fixed human-joint affinity, according to human skeleton. This may reduce adaptation capacity of GCN to tackle complex spatio-temporal pose variations in videos. To alleviate this problem, we propose a novel Dynamical Graph Network (DG-Net), which can dynamically identify human-joint affinity, and estimate 3D pose by adaptively learning spatial/temporal joint relations from videos. Different from traditional graph convolution, we introduce Dynamical Spatial/Temporal Graph convolution (DSG/DTG) to discover spatial/temporal human-joint affinity for each video exemplar, depending on spatial distance/temporal movement similarity between human joints in this video. Hence, they can effectively understand which joints are spatially closer and/or have consistent motion, for reducing depth ambiguity and/or motion uncertainty when lifting 2D pose to 3D pose. We conduct extensive experiments on three popular benchmarks, e.g., Human3.6M, HumanEva-I, and MPI-INF-3DHP, where DG-Net outperforms a number of recent SOTA approaches with fewer input frames and model size.

READ FULL TEXT

page 1

page 9

page 10

research
03/11/2020

GAST-Net: Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video

3D pose estimation in video can benefit greatly from both temporal and s...
research
03/02/2022

MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video

Recent transformer-based solutions have been introduced to estimate 3D h...
research
07/16/2021

Conditional Directed Graph Convolution for 3D Human Pose Estimation

Graph convolutional networks have significantly improved 3D human pose e...
research
01/18/2023

HSTFormer: Hierarchical Spatial-Temporal Transformers for 3D Human Pose Estimation

Transformer-based approaches have been successfully proposed for 3D huma...
research
10/09/2021

Space-Time-Separable Graph Convolutional Network for Pose Forecasting

Human pose forecasting is a complex structured-data sequence-modelling t...
research
05/23/2019

Pose estimator and tracker using temporal flow maps for limbs

For human pose estimation in videos, it is significant how to use tempor...
research
08/01/2022

Pose Uncertainty Aware Movement Synchrony Estimation via Spatial-Temporal Graph Transformer

Movement synchrony reflects the coordination of body movements between i...

Please sign up or login with your details

Forgot password? Click here to reset