ADTOF: A large dataset of non-synthetic music for automatic drum transcription

11/23/2021
by   Mickaël Zehren, et al.
0

The state-of-the-art methods for drum transcription in the presence of melodic instruments (DTM) are machine learning models trained in a supervised manner, which means that they rely on labeled datasets. The problem is that the available public datasets are limited either in size or in realism, and are thus suboptimal for training purposes. Indeed, the best results are currently obtained via a rather convoluted multi-step training process that involves both real and synthetic datasets. To address this issue, starting from the observation that the communities of rhythm games players provide a large amount of annotated data, we curated a new dataset of crowdsourced drum transcriptions. This dataset contains real-world music, is manually annotated, and is about two orders of magnitude larger than any other non-synthetic dataset, making it a prime candidate for training purposes. However, due to crowdsourcing, the initial annotations contain mistakes. We discuss how the quality of the dataset can be improved by automatically correcting different types of mistakes. When used to train a popular DTM model, the dataset yields a performance that matches that of the state-of-the-art for DTM, thus demonstrating the quality of the annotations.

READ FULL TEXT
research
06/18/2018

Towards multi-instrument drum transcription

Automatic drum transcription, a subtask of the more general automatic mu...
research
09/09/2016

Harassment detection: a benchmark on the #HackHarassment dataset

Online harassment has been a problem to a greater or lesser extent since...
research
01/30/2021

Melon Playlist Dataset: a public dataset for audio-based playlist generation and music tagging

One of the main limitations in the field of audio signal processing is t...
research
11/28/2022

MuSFA: Improving Music Structural Function Analysis with Partially Labeled Data

Music structure analysis (MSA) systems aim to segment a song recording i...
research
07/31/2021

On The State of Data In Computer Vision: Human Annotations Remain Indispensable for Developing Deep Learning Models

High-quality labeled datasets play a crucial role in fueling the develop...
research
08/04/2021

Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney

The performance of machine learning algorithms used for the segmentation...
research
10/11/2022

Habitat-Matterport 3D Semantics Dataset

We present the Habitat-Matterport 3D Semantics (HM3DSEM) dataset. HM3DSE...

Please sign up or login with your details

Forgot password? Click here to reset