DeepAI AI Chat
Log In Sign Up

Towards robust music source separation on loud commercial music

08/30/2022
by   Chang-Bin Jeon, et al.
Seoul National University
0

Nowadays, commercial music has extreme loudness and heavily compressed dynamic range compared to the past. Yet, in music source separation, these characteristics have not been thoroughly considered, resulting in the domain mismatch between the laboratory and the real world. In this paper, we confirmed that this domain mismatch negatively affect the performance of the music source separation networks. To this end, we first created the out-of-domain evaluation datasets, musdb-L and XL, by mimicking the music mastering process. Then, we quantitatively verify that the performance of the state-of-the-art algorithms significantly deteriorated in our datasets. Lastly, we proposed LimitAug data augmentation method to reduce the domain mismatch, which utilizes an online limiter during the training data sampling process. We confirmed that it not only alleviates the performance degradation on our out-of-domain datasets, but also results in higher performance on in-domain data.

READ FULL TEXT

page 1

page 2

page 3

page 4

10/23/2020

A Study of Transfer Learning in Music Source Separation

Supervised deep learning methods for performing audio source separation ...
02/19/2021

CatNet: music source separation system with mix-audio augmentation

Music source separation (MSS) is the task of separating a music piece in...
08/31/2021

Music Demixing Challenge 2021

Music source separation has been intensively studied in the last decade ...
11/05/2021

Hybrid Spectrogram and Waveform Source Separation

Source separation models either work on the spectrogram or waveform doma...
08/06/2020

Mixing-Specific Data Augmentation Techniques for Improved Blind Violin/Piano Source Separation

Blind music source separation has been a popular and active subject of r...
07/28/2021

Neural Remixer: Learning to Remix Music with Interactive Control

The task of manipulating the level and/or effects of individual instrume...
04/12/2022

Low Latency Time Domain Multichannel Speech and Music Source Separation

The Goal is to obtain a simple multichannel source separation with very ...