Multi-task U-Net for Music Source Separation

03/23/2020
by   Venkatesh S. Kadandale, et al.
0

A fairly straightforward approach for music source separation is to train independent models, wherein each model is dedicated for estimating only a specific source. Training a single model to estimate multiple sources generally does not perform as well as the independent dedicated models. However, Conditioned U-Net (C-U-Net) uses a control mechanism to train a single model for multi-source separation and attempts to achieve a performance comparable to that of the dedicated models. We propose a multi-task U-Net (M-U-Net) trained using a weighted multi-task loss as an alternative to the C-U-Net. We investigate two weighting strategies for our multi-task loss: 1) Dynamic Weighted Average (DWA), and 2) Energy Based Weighting (EBW). DWA determines the weights by tracking the rate of change of loss of each task during training. EBW aims to neutralize the effect of the training bias arising from the difference in energy levels of each of the sources in a mixture. Our methods provide two-fold advantages compared to the C-U-Net: 1) Fewer effective training iterations with no conditioning, and 2) Fewer trainable network parameters (no control parameters). Our methods achieve performance comparable to that of C-U-Net and the dedicated U-Nets at a much lower training cost.

READ FULL TEXT

page 1

page 4

page 5

research
03/23/2020

Multi-channel U-Net for Music Source Separation

A fairly straightforward approach for music source separation is to trai...
research
02/17/2020

Meta-learning Extractors for Music Source Separation

We propose a hierarchical meta-learning-inspired model for music source ...
research
07/02/2019

Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations

Data-driven models for audio source separation such as U-Net or Wave-U-N...
research
08/14/2019

Interleaved Multitask Learning for Audio Source Separation with Independent Databases

Deep Neural Network-based source separation methods usually train indepe...
research
11/24/2021

LightSAFT: Lightweight Latent Source Aware Frequency Transform for Source Separation

Conditioned source separations have attracted significant attention beca...
research
11/29/2019

J-Net: Randomly weighted U-Net for audio source separation

Several results in the computer vision literature have shown the potenti...
research
09/03/2019

Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed

We study the problem of source separation for music using deep learning ...

Please sign up or login with your details

Forgot password? Click here to reset