Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CNMF

This work addresses the problem of multichannel source separation combining two powerful approaches, multichannel spectral factorization with recent monophonic deep-learning (DL) based spectrum inference. Individual source spectra at different channels are estimated with a Masker-Denoiser Twin Network (MaD TwinNet), able to model long-term temporal patterns of a musical piece. The monophonic source spectrograms are used within a spatial covariance mixing model based on Complex Non-Negative Matrix Factorization (CNMF) that predicts the spatial characteristics of each source. The proposed framework is evaluated on the task of singing voice separation with a large multichannel dataset. Experimental results show that our joint DL+CNMF method outperforms both the individual monophonic DL-based separation and the multichannel CNMF baseline methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/24/2022

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Supervised deep learning approaches to underdetermined audio source sepa...
research
02/01/2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation

Monaural singing voice separation task focuses on the prediction of the ...
research
03/18/2019

A Vocoder Based Method For Singing Voice Extraction

This paper presents a novel method for extracting the vocal track from a...
research
06/27/2012

Variational Inference in Non-negative Factorial Hidden Markov Models for Efficient Audio Source Separation

The past decade has seen substantial work on the use of non-negative mat...
research
10/31/2017

SVSGAN: Singing Voice Separation via Generative Adversarial Network

Separating two sources from an audio mixture is an important task with m...
research
08/11/2020

Exploring Aligned Lyrics-Informed Singing Voice Separation

In this paper, we propose a method of utilizing aligned lyrics as additi...
research
01/12/2018

Separation of Instrument Sounds using Non-negative Matrix Factorization with Spectral Envelope Constraints

Spectral envelope is one of the most important features that characteriz...

Please sign up or login with your details

Forgot password? Click here to reset