Taming Diffusion Models for Music-driven Conducting Motion Generation

06/15/2023
by   Zhuoran Zhao, et al.
0

Generating the motion of orchestral conductors from a given piece of symphony music is a challenging task since it requires a model to learn semantic music features and capture the underlying distribution of real conducting motion. Prior works have applied Generative Adversarial Networks (GAN) to this task, but the promising diffusion model, which recently showed its advantages in terms of both training stability and output quality, has not been exploited in this context. This paper presents Diffusion-Conductor, a novel DDIM-based approach for music-driven conducting motion generation, which integrates the diffusion model to a two-stage learning framework. We further propose a random masking strategy to improve the feature robustness, and use a pair of geometric loss functions to impose additional regularizations and increase motion diversity. We also design several novel metrics, including Frechet Gesture Distance (FGD) and Beat Consistency Score (BC) for a more comprehensive evaluation of the generated motion. Experimental results demonstrate the advantages of our model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2023

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation

When hearing music, it is natural for people to dance to its rhythm. Aut...
research
12/03/2021

Music-to-Dance Generation with Optimal Transport

Dance choreography for a piece of music is a challenging task, having to...
research
03/14/2023

DiffuseRoll: Multi-track multi-category music generation based on diffusion model

Recent advancements in generative models have shown remarkable progress ...
research
09/29/2022

Human Motion Diffusion Model

Natural and expressive human motion generation is the holy grail of comp...
research
01/28/2022

Dual Learning Music Composition and Dance Choreography

Music and dance have always co-existed as pillars of human activities, c...
research
02/09/2023

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

In recent years, there has been an increased popularity in image and spe...
research
07/15/2022

ChoreoGraph: Music-conditioned Automatic Dance Choreography over a Style and Tempo Consistent Dynamic Graph

To generate dance that temporally and aesthetically matches the music is...

Please sign up or login with your details

Forgot password? Click here to reset