The MIDI Degradation Toolkit: Symbolic Music Augmentation and Correction

09/30/2020
by   Andrew McLeod, et al.
0

In this paper, we introduce the MIDI Degradation Toolkit (MDTK), containing functions which take as input a musical excerpt (a set of notes with pitch, onset time, and duration), and return a "degraded" version of that excerpt with some error (or errors) introduced. Using the toolkit, we create the Altered and Corrupted MIDI Excerpts dataset version 1.0 (ACME v1.0), and propose four tasks of increasing difficulty to detect, classify, locate, and correct the degradations. We hypothesize that models trained for these tasks can be useful in (for example) improving automatic music transcription performance if applied as a post-processing step. To that end, MDTK includes a script that measures the distribution of different types of errors in a transcription, and creates a degraded dataset with similar properties. MDTK's degradations can also be applied dynamically to a dataset during training (with or without the above script), generating novel degraded excerpts each epoch. MDTK could also be used to test the robustness of any system designed to take MIDI (or similar) data as input (e.g. systems designed for voice separation, metrical alignment, or chord detection) to such transcription errors or otherwise noisy data. The toolkit and dataset are both publicly available online, and we encourage contribution and feedback from the community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2022

musicaiz: A Python Library for Symbolic Music Generation, Analysis and Visualization

In this article, we present musicaiz, an object-oriented library for ana...
research
10/06/2022

AnimeTAB: A new guitar tablature dataset of anime and game music

While guitar tablature has become a popular topic in MIR research, there...
research
04/21/2020

Music Generation with Temporal Structure Augmentation

In this paper we introduce a novel feature augmentation approach for gen...
research
10/13/2021

Singer separation for karaoke content generation

Due to the rapid development of deep learning, we can now successfully s...
research
10/30/2022

WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit

Keyword spotting (KWS) enables speech-based user interaction and gradual...
research
10/21/2020

NeuSpell: A Neural Spelling Correction Toolkit

We introduce NeuSpell, an open-source toolkit for spelling correction in...
research
07/20/2017

FORM version 4.2

We introduce FORM 4.2, a new minor release of the symbolic manipulation ...

Please sign up or login with your details

Forgot password? Click here to reset