SINC: Spatial Composition of 3D Human Motions for Simultaneous Action Generation

04/20/2023
by   Nikos Athanasiou, et al.
0

Our goal is to synthesize 3D human motions given textual inputs describing simultaneous actions, for example 'waving hand' while 'walking' at the same time. We refer to generating such simultaneous movements as performing 'spatial compositions'. In contrast to temporal compositions that seek to transition from one action to another, spatial compositing requires understanding which body parts are involved in which action, to be able to move them simultaneously. Motivated by the observation that the correspondence between actions and body parts is encoded in powerful language models, we extract this knowledge by prompting GPT-3 with text such as "what are the body parts involved in the action <action name>?", while also providing the parts list and few-shot examples. Given this action-part mapping, we combine body parts from two motions together and establish the first automated method to spatially compose two actions. However, training data with compositional actions is always limited by the combinatorics. Hence, we further create synthetic data with this approach, and use it to train a new state-of-the-art text-to-motion generation model, called SINC ("SImultaneous actioN Compositions for 3D human motions"). In our experiments, we find training on additional synthetic GPT-guided compositional motions improves text-to-motion generation.

READ FULL TEXT

page 1

page 3

page 7

page 8

page 11

page 12

page 13

research
09/09/2022

TEACH: Temporal Action Composition for 3D Humans

Given a series of natural language descriptions, our task is to generate...
research
08/03/2023

Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling

Text-to-motion generation has gained increasing attention, but most exis...
research
06/15/2016

A Hierarchical Pose-Based Approach to Complex Action Understanding Using Dictionaries of Actionlets and Motion Poselets

In this paper, we introduce a new hierarchical model for human action re...
research
03/26/2021

Synthesis of Compositional Animations from Textual Descriptions

"How can we animate 3D-characters from a movie script or move robots by ...
research
12/08/2022

Generating Holistic 3D Human Motion from Speech

This work addresses the problem of generating 3D holistic body motions f...
research
08/18/2023

Language-guided Human Motion Synthesis with Atomic Actions

Language-guided human motion synthesis has been a challenging task due t...
research
10/21/2014

Compositional Structure Learning for Action Understanding

The focus of the action understanding literature has predominately been ...

Please sign up or login with your details

Forgot password? Click here to reset