On the Use of Deep Mask Estimation Module for Neural Source Separation Systems

06/15/2022
by   Kai Li, et al.
0

Most of the recent neural source separation systems rely on a masking-based pipeline where a set of multiplicative masks are estimated from and applied to a signal representation of the input mixture. The estimation of such masks, in almost all network architectures, is done by a single layer followed by an optional nonlinear activation function. However, recent literatures have investigated the use of a deep mask estimation module and observed performance improvement compared to a shallow mask estimation module. In this paper, we analyze the role of such deeper mask estimation module by connecting it to a recently proposed unsupervised source separation method, and empirically show that the deep mask estimation module is an efficient approximation of the so-called overseparation-grouping paradigm with the conventional shallow mask estimation layers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/02/2018

Phasebook and Friends: Leveraging Discrete Representations for Source Separation

Deep learning based speech enhancement and source separation systems hav...
research
03/24/2015

Probabilistic Binary-Mask Cocktail-Party Source Separation in a Convolutional Deep Neural Network

Separation of competing speech is a key challenge in signal processing a...
research
10/23/2020

GSEP: A robust vocal and accompaniment separation system using gated CBHG module and loudness normalization

In the field of audio signal processing research, source separation has ...
research
09/12/2021

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation

Deep neural network based methods have been successfully applied to musi...
research
03/14/2023

Multi-Channel Masking with Learnable Filterbank for Sound Source Separation

This work proposes a learnable filterbank based on a multi-channel maski...
research
07/31/2023

Deep Learning Meets Adaptive Filtering: A Stein's Unbiased Risk Estimator Approach

This paper revisits two prominent adaptive filtering algorithms through ...
research
03/07/2021

HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation

The advent of deep learning has led to the prevalence of deep neural net...

Please sign up or login with your details

Forgot password? Click here to reset