Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform

01/28/2020
by   Tomohiko Nakamura, et al.
0

We propose a time-domain audio source separation method using down-sampling (DS) and up-sampling (US) layers based on a discrete wavelet transform (DWT). The proposed method is based on one of the state-of-the-art deep neural networks, Wave-U-Net, which successively down-samples and up-samples feature maps. We find that this architecture resembles that of multiresolution analysis, and reveal that the DS layers of Wave-U-Net cause aliasing and may discard information useful for the separation. Although the effects of these problems may be reduced by training, to achieve a more reliable source separation method, we should design DS layers capable of overcoming the problems. With this belief, focusing on the fact that the DWT has an anti-aliasing filter and the perfect reconstruction property, we design the proposed layers. Experiments on music source separation show the efficacy of the proposed method and the importance of simultaneously considering the anti-aliasing filters and the perfect reconstruction property.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2018

Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation

Models for audio source separation usually operate on the magnitude spec...
research
05/10/2021

Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method

Audio source separation is often used as preprocessing of various applic...
research
11/05/2018

End-to-End Sound Source Separation Conditioned On Instrument Labels

Can we perform an end-to-end sound source separation (SSS) with a variab...
research
11/29/2019

J-Net: Randomly weighted U-Net for audio source separation

Several results in the computer vision literature have shown the potenti...
research
11/23/2021

Upsampling layers for music source separation

Upsampling artifacts are caused by problematic upsampling layers and due...
research
06/19/2023

Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides

In this paper, we propose algorithms for handling non-integer strides in...
research
01/15/2019

Spectrogram Feature Losses for Music Source Separation

In this paper we study deep learning-based music source separation, and ...

Please sign up or login with your details

Forgot password? Click here to reset