Symbolic Music Generation with Diffusion Models

03/30/2021
by   Gautam Mittal, et al.
0

Score-based generative models and diffusion probabilistic models have been successful at generating high-quality samples in continuous domains such as images and audio. However, due to their Langevin-inspired sampling mechanisms, their application to discrete and sequential data has been limited. In this work, we present a technique for training diffusion models on sequential data by parameterizing the discrete domain in the continuous latent space of a pre-trained variational autoencoder. Our method is non-autoregressive and learns to generate sequences of latent embeddings through the reverse process and offers parallel generation with a constant number of iterative refinement steps. We apply this technique to modeling symbolic music and show strong unconditional generation and post-hoc conditional infilling results compared to autoregressive language models operating over the same continuous embeddings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

Latent Diffusion for Language Generation

Diffusion models have achieved great success in modeling continuous data...
research
05/16/2023

Discrete Diffusion Probabilistic Models for Symbolic Music Generation

Denoising Diffusion Probabilistic Models (DDPMs) have made great strides...
research
08/19/2022

Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation

Sign Language Production (SLP) aims to translate spoken languages into s...
research
06/12/2023

Latent Dynamical Implicit Diffusion Processes

Latent dynamical models are commonly used to learn the distribution of a...
research
10/05/2021

Autoregressive Diffusion Models

We introduce Autoregressive Diffusion Models (ARDMs), a model class enco...
research
08/23/2023

Boosting Diffusion Models with an Adaptive Momentum Sampler

Diffusion probabilistic models (DPMs) have been shown to generate high-q...
research
08/27/2023

Multi-plane denoising diffusion-based dimensionality expansion for 2D-to-3D reconstruction of microstructures with harmonized sampling

Acquiring reliable microstructure datasets is a pivotal step toward the ...

Please sign up or login with your details

Forgot password? Click here to reset