Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method

05/10/2021
by   Koichi Saito, et al.
0

Audio source separation is often used as preprocessing of various applications, and one of its ultimate goals is to construct a single versatile model capable of dealing with the varieties of audio signals. Since sampling frequency, one of the audio signal varieties, is usually application specific, the preceding audio source separation model should be able to deal with audio signals of all sampling frequencies specified in the target applications. However, conventional models based on deep neural networks (DNNs) are trained only at the sampling frequency specified by the training data, and there are no guarantees that they work with unseen sampling frequencies. In this paper, we propose a convolution layer capable of handling arbitrary sampling frequencies by a single DNN. Through music source separation experiments, we show that the introduction of the proposed layer enables a conventional audio source separation model to consistently work with even unseen sampling frequencies.

READ FULL TEXT

page 1

page 5

research
06/10/2021

Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation

We address the determined audio source separation problem in the time-fr...
research
06/05/2022

Sampling Frequency Independent Dialogue Separation

In some DNNs for audio source separation, the relevant model parameters ...
research
06/19/2023

Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides

In this paper, we propose algorithms for handling non-integer strides in...
research
12/22/2014

Audio Source Separation Using a Deep Autoencoder

This paper proposes a novel framework for unsupervised audio source sepa...
research
10/14/2021

Student-t Networks for Melody Estimation

Melody estimation or melody extraction refers to the extraction of the p...
research
01/28/2020

Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform

We propose a time-domain audio source separation method using down-sampl...
research
09/05/2023

A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation

Cinematic audio source separation is a relatively new subtask of audio s...

Please sign up or login with your details

Forgot password? Click here to reset