The Power of Reuse: A Multi-Scale Transformer Model for Structural Dynamic Segmentation in Symbolic Music Generation

05/17/2022
by   Guowei Wu, et al.
0

Symbolic Music Generation relies on the contextual representation capabilities of the generative model, where the most prevalent approach is the Transformer-based model. Not only that, the learning of long-term context is also related to the dynamic segmentation of musical structures, i.e. intro, verse and chorus, which is currently overlooked by the research community. In this paper, we propose a multi-scale Transformer, which uses coarse-decoder and fine-decoders to model the contexts at the global and section-level, respectively. Concretely, we designed a Fragment Scope Localization layer to syncopate the music into sections, which were later used to pre-train fine-decoders. After that, we designed a Music Style Normalization layer to transfer the style information from the original sections to the generated sections to achieve consistency in music style. The generated sections are combined in the aggregation layer and fine-tuned by the coarse decoder. Our model is evaluated on two open MIDI datasets, and experiments show that our model outperforms the best contemporary symbolic music generative models. More excitingly, visual evaluation shows that our model is superior in melody reuse, resulting in more realistic music.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2021

MuseMorphose: Full-Song and Fine-Grained Music Style Transfer with One Transformer VAE

Transformers and variational autoencoders (VAE) have been extensively em...
research
09/20/2018

Symbolic Music Genre Transfer with CycleGAN

Deep generative models such as Variational Autoencoders (VAEs) and Gener...
research
04/18/2023

From Words to Music: A Study of Subword Tokenization Techniques in Symbolic Music Generation

Subword tokenization has been widely successful in text-based natural la...
research
07/30/2021

DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models

Originating in the Renaissance and burgeoning in the digital era, tablat...
research
02/13/2022

Learning long-term music representations via hierarchical contextual constraints

Learning symbolic music representations, especially disentangled represe...
research
12/10/2019

Encoding Musical Style with Transformer Autoencoders

We consider the problem of learning high-level controls over the global ...
research
10/06/2022

Melody Infilling with User-Provided Structural Context

This paper proposes a novel Transformer-based model for music score infi...

Please sign up or login with your details

Forgot password? Click here to reset