HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation

03/07/2021
by   Christos Garoufis, et al.
0

The advent of deep learning has led to the prevalence of deep neural network architectures for monaural music source separation, with end-to-end approaches that operate directly on the waveform level increasingly receiving research attention. Among these approaches, transformation of the input mixture to a learned latent space, and multiplicative application of a soft mask to the latent mixture, achieves the best performance, but is prone to the introduction of artifacts to the source estimate. To alleviate this problem, in this paper we propose a hybrid time-domain approach, termed the HTMD-Net, combining a lightweight masking component and a denoising module, based on skip connections, in order to refine the source estimated by the masking procedure. Evaluation of our approach in the task of monaural singing voice separation in the musdb18 dataset indicates that our proposed method achieves competitive performance compared to methods based purely on masking when trained under the same conditions, especially regarding the behavior during silent segments, while achieving higher computational efficiency.

READ FULL TEXT
research
03/05/2023

Hybrid Y-Net Architecture for Singing Voice Separation

This research paper presents a novel deep learning-based neural network ...
research
10/05/2018

End-to-end Networks for Supervised Single-channel Speech Separation

The performance of single channel source separation algorithms has impro...
research
10/22/2019

Two-Step Sound Source Separation: Training on Learned Latent Targets

In this paper, we propose a two-step training procedure for source separ...
research
08/19/2019

Audio query-based music source separation

In recent years, music source separation has been one of the most intens...
research
09/12/2019

TF-Attention-Net: An End To End Neural Network For Singing Voice Separation

In terms of source separation task, most of deep neural networks have tw...
research
06/15/2022

On the Use of Deep Mask Estimation Module for Neural Source Separation Systems

Most of the recent neural source separation systems rely on a masking-ba...
research
11/22/2022

Latent Iterative Refinement for Modular Source Separation

Traditional source separation approaches train deep neural network model...

Please sign up or login with your details

Forgot password? Click here to reset