End-to-End Sound Source Separation Conditioned On Instrument Labels

11/05/2018
by   Olga Slizovskaia, et al.
0

Can we perform an end-to-end sound source separation (SSS) with a variable number of sources using a deep learning model? This paper presents an extension of the Wave-U-Net model which allows end-to-end monaural source separation with a non-fixed number of sources. Furthermore, we propose multiplicative conditioning with instrument labels at the bottleneck of the Wave-U-Net and show its effect on the separation results. This approach can be further extended to other types of conditioning such as audio-visual SSS and score-informed SSS.

READ FULL TEXT
research
10/29/2018

End-to-end music source separation: is it possible in the waveform domain?

Most of the currently successful source separation techniques use the ma...
research
02/15/2022

SpaIn-Net: Spatially-Informed Stereophonic Music Source Separation

With the recent advancements of data driven approaches using deep neural...
research
05/03/2022

Few-Shot Musical Source Separation

Deep learning-based approaches to musical source separation are often li...
research
01/28/2020

Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform

We propose a time-domain audio source separation method using down-sampl...
research
09/12/2019

TF-Attention-Net: An End To End Neural Network For Singing Voice Separation

In terms of source separation task, most of deep neural networks have tw...
research
08/30/2019

Recursive Visual Sound Separation Using Minus-Plus Net

Sounds provide rich semantics, complementary to visual data, for many ta...
research
07/02/2019

Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations

Data-driven models for audio source separation such as U-Net or Wave-U-N...

Please sign up or login with your details

Forgot password? Click here to reset