A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation

05/26/2023
by   Xipin Wei, et al.
0

Recently, multi-instrument music generation has become a hot topic. Different from single-instrument generation, multi-instrument generation needs to consider inter-track harmony besides intra-track coherence. This is usually achieved by composing note segments from different instruments into a signal sequence. This composition could be on different scales, such as note, bar, or track. Most existing work focuses on a particular scale, leading to a shortage in modeling music with diverse temporal and track dependencies. This paper proposes a multi-scale attentive Transformer model to improve the quality of multi-instrument generation. We first employ multiple Transformer decoders to learn multi-instrument representations of different scales and then design an attentive mechanism to fuse the multi-scale information. Experiments conducted on SOD and LMD datasets show that our model improves both quantitative and qualitative performance compared to models based on single-scale information. The source code and some generated samples can be found at https://github.com/HaRry-qaq/MSAT.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2023

From Words to Music: A Study of Subword Tokenization Techniques in Symbolic Music Generation

Subword tokenization has been widely successful in text-based natural la...
research
08/13/2020

MMM : Exploring Conditional Multi-Track Music Generation with the Transformer

We propose the Multi-Track Music Machine (MMM), a generative system base...
research
04/21/2022

SinTra: Learning an inspiration model from a single multi-track music segment

In this paper, we propose SinTra, an auto-regressive sequential generati...
research
07/28/2021

Pitch-Informed Instrument Assignment Using a Deep Convolutional Network with Multiple Kernel Shapes

This paper proposes a deep convolutional neural network for performing n...
research
07/19/2017

Metrical-accent Aware Vocal Onset Detection in Polyphonic Audio

The goal of this study is the automatic detection of onsets of the singi...
research
08/28/2023

InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models

Music editing primarily entails the modification of instrument tracks or...
research
08/30/2020

Hierarchical Timbre-Painting and Articulation Generation

We present a fast and high-fidelity method for music generation, based o...

Please sign up or login with your details

Forgot password? Click here to reset