Pose Uncertainty Aware Movement Synchrony Estimation via Spatial-Temporal Graph Transformer

08/01/2022
by   Jicheng Li, et al.
0

Movement synchrony reflects the coordination of body movements between interacting dyads. The estimation of movement synchrony has been automated by powerful deep learning models such as transformer networks. However, instead of designing a specialized network for movement synchrony estimation, previous transformer-based works broadly adopted architectures from other tasks such as human activity recognition. Therefore, this paper proposed a skeleton-based graph transformer for movement synchrony estimation. The proposed model applied ST-GCN, a spatial-temporal graph convolutional neural network for skeleton feature extraction, followed by a spatial transformer for spatial feature generation. The spatial transformer is guided by a uniquely designed joint position embedding shared between the same joints of interacting individuals. Besides, we incorporated a temporal similarity matrix in temporal attention computation considering the periodic intrinsic of body movements. In addition, the confidence score associated with each joint reflects the uncertainty of a pose, while previous works on movement synchrony estimation have not sufficiently emphasized this point. Since transformer networks demand a significant amount of data to train, we constructed a dataset for movement synchrony estimation using Human3.6M, a benchmark dataset for human activity recognition, and pretrained our model on it using contrastive learning. We further applied knowledge distillation to alleviate information loss introduced by pose detector failure in a privacy-preserving way. We compared our method with representative approaches on PT13, a dataset collected from autism therapy interventions. Our method achieved an overall accuracy of 88.98 its counterparts by a wide margin while maintaining data privacy.

READ FULL TEXT

page 6

page 9

research
08/17/2020

Spatial Temporal Transformer Network for Skeleton-based Action Recognition

Skeleton-based Human Activity Recognition has achieved a great interest ...
research
09/07/2021

GCsT: Graph Convolutional Skeleton Transformer for Action Recognition

Graph convolutional networks (GCNs) achieve promising performance for sk...
research
01/11/2020

Towards Generalizable Surgical Activity Recognition Using Spatial Temporal Graph Convolutional Networks

Modeling and recognition of surgical activities poses an interesting res...
research
08/01/2022

Dyadic Movement Synchrony Estimation Under Privacy-preserving Conditions

Movement synchrony refers to the dynamic temporal connection between the...
research
05/05/2022

Koopman pose predictions for temporally consistent human walking estimations

We tackle the problem of tracking the human lower body as an initial ste...
research
09/15/2021

Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos

Graph Convolution Network (GCN) has been successfully used for 3D human ...
research
07/15/2022

A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion

Multi-person motion capture can be challenging due to ambiguities caused...

Please sign up or login with your details

Forgot password? Click here to reset