FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

01/26/2022
by   Dimitri von Rütte, et al.
23

Generating music with deep neural networks has been an area of active research in recent years. While the quality of generated samples has been steadily increasing, most methods are only able to exert minimal control over the generated sequence, if any. We propose the self-supervised description-to-sequence task, which allows for fine-grained controllable generation on a global level. We do so by extracting high-level features about the target sequence and learning the conditional distribution of sequences given the corresponding high-level description in a sequence-to-sequence modelling setup. We train FIGARO (FIne-grained music Generation via Attention-based, RObust control) by applying description-to-sequence modelling to symbolic music. By combining learned high level features with domain knowledge, which acts as a strong inductive bias, the model achieves state-of-the-art results in controllable symbolic music generation and generalizes well beyond the training distribution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2023

Exploring Softly Masked Language Modelling for Controllable Symbolic Music Generation

This document presents some early explorations of applying Softly Masked...
research
06/14/2023

Anticipatory Music Transformer

We introduce anticipation: a method for constructing a controllable gene...
research
07/29/2020

Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling

High-level musical qualities (such as emotion) are often abstract, subje...
research
07/21/2021

Melody Structure Transfer Network: Generating Music with Separable Self-Attention

Symbolic music generation has attracted increasing attention, while most...
research
04/27/2021

Generating Lead Sheets with Affect: A Novel Conditional seq2seq Framework

The field of automatic music composition has seen great progress in the ...
research
12/16/2017

Automatic Music Highlight Extraction using Convolutional Recurrent Attention Networks

Music highlights are valuable contents for music services. Most methods ...
research
07/05/2023

LOAF-M2L: Joint Learning of Wording and Formatting for Singable Melody-to-Lyric Generation

Despite previous efforts in melody-to-lyric generation research, there i...

Please sign up or login with your details

Forgot password? Click here to reset