Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides

06/19/2023
by   Kanami Imamura, et al.
0

In this paper, we propose algorithms for handling non-integer strides in sampling-frequency-independent (SFI) convolutional and transposed convolutional layers. The SFI layers have been developed for handling various sampling frequencies (SFs) by a single neural network. They are replaceable with their non-SFI counterparts and can be introduced into various network architectures. However, they could not handle some specific configurations when combined with non-SFI layers. For example, an SFI extension of Conv-TasNet, a standard audio source separation model, cannot handle some pairs of trained and target SFs because the strides of the SFI layers become non-integers. This problem cannot be solved by simple rounding or signal resampling, resulting in the significant performance degradation. To overcome this problem, we propose algorithms for handling non-integer strides by using windowed sinc interpolation. The proposed algorithms realize the continuous-time representations of features using the interpolation and enable us to sample instants with the desired stride. Experimental results on music source separation showed that the proposed algorithms outperformed the rounding- and signal-resampling-based methods at SFs lower than the trained SF.

READ FULL TEXT
research
05/10/2021

Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method

Audio source separation is often used as preprocessing of various applic...
research
11/23/2021

Upsampling layers for music source separation

Upsampling artifacts are caused by problematic upsampling layers and due...
research
06/05/2022

Sampling Frequency Independent Dialogue Separation

In some DNNs for audio source separation, the relevant model parameters ...
research
01/28/2020

Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform

We propose a time-domain audio source separation method using down-sampl...
research
12/22/2014

Audio Source Separation with Discriminative Scattering Networks

In this report we describe an ongoing line of research for solving singl...
research
10/05/2020

D3Net: Densely connected multidilated DenseNet for music source separation

Music source separation involves a large input field to model a long-ter...
research
03/17/2020

Hyperplane Arrangements of Trained ConvNets Are Biased

We investigate the geometric properties of the functions learned by trai...

Please sign up or login with your details

Forgot password? Click here to reset