Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity

09/18/2019
by   Ethan Manilow, et al.
0

Music source separation performance has greatly improved in recent years with the advent of approaches based on deep learning. Such methods typically require large amounts of labelled training data, which in the case of music consist of mixtures and corresponding instrument stems. However, stems are unavailable for most commercial music, and only limited datasets have so far been released to the public. It can thus be difficult to draw conclusions when comparing various source separation methods, as the difference in performance may stem as much from better data augmentation techniques or training tricks to alleviate the limited availability of training data, as from intrinsically better model architectures and objective functions. In this paper, we present the synthesized Lakh dataset (Slakh) as a new tool for music source separation research. Slakh consists of high-quality renderings of instrumental mixtures and corresponding stems generated from the Lakh MIDI dataset (LMD) using professional-grade sample-based virtual instruments. A first version, Slakh2100, focuses on 2100 songs, resulting in 145 hours of mixtures. While not fully comparable because it is purely instrumental, this dataset contains an order of magnitude more data than MUSDB18, the de facto standard dataset in the field. We show that Slakh can be used to effectively augment existing datasets for musical instrument separation, while opening the door to a wide array of data-intensive music signal analysis tasks.

READ FULL TEXT
research
04/19/2022

Music Source Separation with Generative Flow

Full supervision models for source separation are trained on mixture-sou...
research
10/23/2020

A Study of Transfer Learning in Music Source Separation

Supervised deep learning methods for performing audio source separation ...
research
09/07/2022

Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments

Choral music separation refers to the task of extracting tracks of voice...
research
09/30/2022

Music Source Separation with Band-split RNN

The performance of music source separation (MSS) models has been greatly...
research
07/24/2023

Self-refining of Pseudo Labels for Music Source Separation with Noisy Labeled Data

Music source separation (MSS) faces challenges due to the limited availa...
research
09/05/2022

Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model

Similar to colorization in computer vision, instrument separation is to ...
research
04/23/2018

An Overview of Lead and Accompaniment Separation in Music

Popular music is often composed of an accompaniment and a lead component...

Please sign up or login with your details

Forgot password? Click here to reset