A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis

08/14/2023
by   Esteve Valls Mascaro, et al.
0

The synthesis of human motion has traditionally been addressed through task-dependent models that focus on specific challenges, such as predicting future motions or filling in intermediate poses conditioned on known key-poses. In this paper, we present a novel task-independent model called UNIMASK-M, which can effectively address these challenges using a unified architecture. Our model obtains comparable or better performance than the state-of-the-art in each field. Inspired by Vision Transformers (ViTs), our UNIMASK-M model decomposes a human pose into body parts to leverage the spatio-temporal relationships existing in human motion. Moreover, we reformulate various pose-conditioned motion synthesis tasks as a reconstruction problem with different masking patterns given as input. By explicitly informing our model about the masked joints, our UNIMASK-M becomes more robust to occlusions. Experimental results show that our model successfully forecasts human motion on the Human3.6M dataset. Moreover, it achieves state-of-the-art results in motion inbetweening on the LaFAN1 dataset, particularly in long transition periods. More information can be found on the project website https://sites.google.com/view/estevevallsmascaro/publications/unimask-m.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/09/2022

Conditional Motion In-betweening

Motion in-betweening (MIB) is a process of generating intermediate skele...
research
06/07/2021

Task-Generic Hierarchical Human Motion Prior using VAEs

A deep generative model that describes human motions can benefit a wide ...
research
03/11/2023

SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction

3D human motion prediction is a research area of high significance and a...
research
05/01/2020

Adversarial Synthesis of Human Pose from Text

This work introduces the novel task of human pose synthesis from text. I...
research
12/16/2022

GFPose: Learning 3D Human Pose Prior with Gradient Fields

Learning 3D human pose prior is essential to human-centered AI. Here, we...
research
10/11/2022

A generic diffusion-based approach for 3D human pose prediction in the wild

3D human pose forecasting, i.e., predicting a sequence of future human 3...
research
04/13/2023

Toward Reliable Human Pose Forecasting with Uncertainty

Recently, there has been an arms race of pose forecasting methods aimed ...

Please sign up or login with your details

Forgot password? Click here to reset