PocketVAE: A Two-step Model for Groove Generation and Control

07/11/2021
by   Kyungyun Lee, et al.
0

Creating a good drum track to imitate a skilled performer in digital audio workstations (DAWs) can be a time-consuming process, especially for those unfamiliar with drums. In this work, we introduce PocketVAE, a groove generation system that applies grooves to users' rudimentary MIDI tracks, i.e, templates. Grooves can be either transferred from a reference track, generated randomly or with conditions, such as genres. Our system, consisting of different modules for each groove component, takes a two-step approach that is analogous to a music creation process. First, the note module updates the user template through addition and deletion of notes; Second, the velocity and microtiming modules add details to this generated note score. In order to model the drum notes, we apply a discrete latent representation method via Vector Quantized Variational Autoencoder (VQ-VAE), as drum notes have a discrete property, unlike velocity and microtiming values. We show that our two-step approach and the usage of a discrete encoding space improves the learning of the original data distribution. Additionally, we discuss the benefit of incorporating control elements - genre, velocity and microtiming patterns - into the model.

READ FULL TEXT

page 2

page 6

research
07/11/2020

Transformer-XL Based Music Generation with Multiple Sequences of Time-valued Notes

Current state-of-the-art AI based classical music creation algorithms su...
research
10/13/2020

A variational autoencoder for music generation controlled by tonal tension

Many of the music generation systems based on neural networks are fully ...
research
08/18/2020

PopMAG: Pop Music Accompaniment Generation

In pop music, accompaniments are usually played by multiple instruments ...
research
06/28/2018

GenerationMania: Learning to Semantically Choreograph

Beatmania is a rhythm action game where players play the role of a DJ th...
research
09/08/2019

MIDI-Sandwich2: RNN-based Hierarchical Multi-modal Fusion Generation VAE networks for multi-track symbolic music generation

Currently, almost all the multi-track music generation models use the Co...
research
10/12/2022

JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment Generation via Transformer VQ-VA

This paper proposes a model that generates a drum track in the audio dom...
research
08/11/2021

Variable-Length Music Score Infilling via XLNet and Musically Specialized Positional Encoding

This paper proposes a new self-attention based model for music score inf...

Please sign up or login with your details

Forgot password? Click here to reset