MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation

02/01/2018
by   Konstantinos Drossos, et al.
0

Monaural singing voice separation task focuses on the prediction of the singing voice from a single channel music mixture signal. Current state of the art (SOTA) results in monaural singing voice separation are obtained with deep learning based methods. In this work we present a novel deep learning based method that learns long-term temporal patterns and structures of a musical piece. We build upon the recently proposed Masker-Denoiser (MaD) architecture and we enhance it with the Twin Networks, a technique to regularize a recurrent generative network using a backward running copy of the network. We evaluate our method using the Demixing Secret Dataset and we obtain an increment to signal-to-distortion ratio (SDR) of 0.37 dB and to signal-to-interference ratio (SIR) of 0.23 dB, compared to previous SOTA results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2017

Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask

Singing voice separation based on deep learning relies on the usage of t...
research
12/02/2019

Investigating Deep Neural Transformations for Spectrogram-based Musical Source Separation

Musical Source Separation (MSS) is a signal processing task that tries t...
research
02/12/2020

Content Based Singing Voice Extraction From a Musical Mixture

We present a deep learning based methodology for extracting the singing ...
research
03/02/2020

Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CNMF

This work addresses the problem of multichannel source separation combin...
research
07/06/2020

Revisiting Representation Learning for Singing Voice Separation with Sinkhorn Distances

In this work we present a method for unsupervised learning of audio repr...
research
10/05/2020

D3Net: Densely connected multidilated DenseNet for music source separation

Music source separation involves a large input field to model a long-ter...
research
12/04/2018

Singing Voice Separation Using a Deep Convolutional Neural Network Trained by Ideal Binary Mask and Cross Entropy

Separating a singing voice from its music accompaniment remains an impor...

Please sign up or login with your details

Forgot password? Click here to reset