Language-guided Human Motion Synthesis with Atomic Actions

08/18/2023
by   Yuanhao Zhai, et al.
0

Language-guided human motion synthesis has been a challenging task due to the inherent complexity and diversity of human behaviors. Previous methods face limitations in generalization to novel actions, often resulting in unrealistic or incoherent motion sequences. In this paper, we propose ATOM (ATomic mOtion Modeling) to mitigate this problem, by decomposing actions into atomic actions, and employing a curriculum learning strategy to learn atomic action composition. First, we disentangle complex human motions into a set of atomic actions during learning, and then assemble novel actions using the learned atomic actions, which offers better adaptability to new actions. Moreover, we introduce a curriculum learning training strategy that leverages masked motion modeling with a gradual increase in the mask ratio, and thus facilitates atomic action assembly. This approach mitigates the overfitting problem commonly encountered in previous methods while enforcing the model to learn better motion representations. We demonstrate the effectiveness of ATOM through extensive experiments, including text-to-motion and action-to-motion synthesis tasks. We further illustrate its superiority in synthesizing plausible and coherent text-guided human motion sequences.

READ FULL TEXT
research
09/09/2022

TEACH: Temporal Action Composition for 3D Humans

Given a series of natural language descriptions, our task is to generate...
research
06/07/2021

Unsupervised Action Segmentation for Instructional Videos

In this paper we address the problem of automatically discovering atomic...
research
05/25/2022

Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis

The ability to synthesize long-term human motion sequences in real-world...
research
09/30/2019

Synthesizing Action Sequences for Modifying Model Decisions

When a model makes a consequential decision, e.g., denying someone a loa...
research
10/22/2022

Diffusion Motion: Generate Text-Guided 3D Human Motion by Diffusion Model

We propose a simple and novel method for generating 3D human motion from...
research
06/10/2023

Process Algebra with Imperfect Actions

We discuss the deal of imperfectness of atomic actions in reality with t...
research
04/20/2023

SINC: Spatial Composition of 3D Human Motions for Simultaneous Action Generation

Our goal is to synthesize 3D human motions given textual inputs describi...

Please sign up or login with your details

Forgot password? Click here to reset