Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting

03/09/2023
by   Xiaogang Peng, et al.
0

Multi-person pose forecasting remains a challenging problem, especially in modeling fine-grained human body interaction in complex crowd scenarios. Existing methods typically represent the whole pose sequence as a temporal series, yet overlook interactive influences among people based on skeletal body parts. In this paper, we propose a novel Trajectory-Aware Body Interaction Transformer (TBIFormer) for multi-person pose forecasting via effectively modeling body part interactions. Specifically, we construct a Temporal Body Partition Module that transforms all the pose sequences into a Multi-Person Body-Part sequence to retain spatial and temporal information based on body semantics. Then, we devise a Social Body Interaction Self-Attention (SBI-MSA) module, utilizing the transformed sequence to learn body part dynamics for inter- and intra-individual interactions. Furthermore, different from prior Euclidean distance-based spatial encodings, we present a novel and efficient Trajectory-Aware Relative Position Encoding for SBI-MSA to offer discriminative spatial information and additional interactive clues. On both short- and long-term horizons, we empirically evaluate our framework on CMU-Mocap, MuPoTS-3D as well as synthesized datasets (6   10 persons), and demonstrate that our method greatly outperforms the state-of-the-art methods. Code will be made publicly available upon acceptance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/19/2022

SoMoFormer: Social-Aware Motion Transformer for Multi-Person Motion Prediction

Multi-person motion prediction remains a challenging problem, especially...
research
07/25/2022

IGFormer: Interaction Graph Transformer for Skeleton-based Human Interaction Recognition

Human interaction recognition is very important in many applications. On...
research
06/13/2023

Pose-aware Attention Network for Flexible Motion Retargeting by Body Part

Motion retargeting is a fundamental problem in computer graphics and com...
research
07/01/2022

MotionMixer: MLP-based 3D Human Body Pose Forecasting

In this work, we present MotionMixer, an efficient 3D human body pose fo...
research
06/06/2023

PGformer: Proxy-Bridged Game Transformer for Multi-Person Extremely Interactive Motion Prediction

Multi-person motion prediction is a challenging task, especially for rea...
research
04/12/2023

Best Practices for 2-Body Pose Forecasting

The task of collaborative human pose forecasting stands for predicting t...
research
07/15/2022

A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion

Multi-person motion capture can be challenging due to ambiguities caused...

Please sign up or login with your details

Forgot password? Click here to reset