MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation

11/14/2022
by   Chang-Bin Jeon, et al.
0

Separation of multiple singing voices into each voice is a rarely studied area in music source separation research. The absence of a benchmark dataset has hindered its progress. In this paper, we present an evaluation dataset and provide baseline studies for multiple singing voices separation. First, we introduce MedleyVox, an evaluation dataset for multiple singing voices separation that corresponds to such categories. We specify the problem definition in this dataset by categorizing the problem into i) duet, ii) unison, iii)main vs. rest, and iv) N-singing separation. Second, we present a strategy for construction of multiple singing mixtures using various single-singing datasets. This can be used to obtain training data. Third, we propose the improved super-resolution network (iSRNet). Jointly trained with the Conv-TasNet and the multi-singing mixture construction strategy, the proposed iSRNet achieved comparable performance to ideal time-frequency masks on duet and unison subsets of MedleyVox. Audio samples, the dataset, and codes are available on our GitHub page (https://github.com/jeonchangbin49/MedleyVox).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2021

A cappella: Audio-visual Singing Voice Separation

Music source separation can be interpreted as the estimation of the cons...
research
08/19/2019

Audio query-based music source separation

In recent years, music source separation has been one of the most intens...
research
09/12/2021

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation

Deep neural network based methods have been successfully applied to musi...
research
05/12/2023

Benchmarks and leaderboards for sound demixing tasks

Music demixing is the task of separating different tracks from the given...
research
12/09/2022

Hyperbolic Audio Source Separation

We introduce a framework for audio source separation using embeddings on...
research
02/01/2018

Approximate Message Passing for Underdetermined Audio Source Separation

Approximate message passing (AMP) algorithms have shown great promise in...
research
11/29/2022

jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus

We construct a corpus of Japanese a cappella vocal ensembles (jaCappella...

Please sign up or login with your details

Forgot password? Click here to reset