Improving singing voice separation with the Wave-U-Net using Minimum Hyperspherical Energy

10/22/2019
by   Joaquin Perez-Lapillo, et al.
0

In recent years, deep learning has surpassed traditional approaches to the problem of singing voice separation. The Wave-U-Net is a recent deep network architecture that operates directly on the time domain. The standard Wave-U-Net is trained with data augmentation and early stopping to prevent overfitting. Minimum hyperspherical energy (MHE) regularization has recently proven to increase generalization in image classification problems by encouraging a diversified filter configuration. In this work, we apply MHE regularization to the 1D filters of the Wave-U-Net. We evaluated this approach for separating the vocal part from mixed music audio recordings on the MUSDB18 dataset. We found that adding MHE regularization to the loss function consistently improves singing voice separation, as measured in the Signal to Distortion Ratio on test recordings, leading to the current best time-domain system for singing voice extraction.

READ FULL TEXT
research
03/04/2019

Improving singing voice separation using Deep U-Net and Wave-U-Net with data augmentation

State-of-the-art singing voice separation is based on deep learning maki...
research
11/27/2018

Improved Speech Enhancement with the Wave-U-Net

We study the use of the Wave-U-Net architecture for speech enhancement, ...
research
07/06/2020

Revisiting Representation Learning for Singing Voice Separation with Sinkhorn Distances

In this work we present a method for unsupervised learning of audio repr...
research
03/28/2022

Improved singing voice separation with chromagram-based pitch-aware remixing

Singing voice separation aims to separate music into vocals and accompan...
research
06/06/2019

Singing voice separation: a study on training data

In the recent years, singing voice separation systems showed increased p...
research
08/14/2023

PitchNet: A Fully Convolutional Neural Network for Pitch Estimation

In the domain of music and sound processing, pitch extraction plays a pi...
research
09/01/2018

Pillar Universities in Russia: The Rise of "the Second Wave"

The problem of identifying the leading universities in a country is rath...

Please sign up or login with your details

Forgot password? Click here to reset