Shift-Invariant Kernel Additive Modelling for Audio Source Separation

11/01/2017
by   Benito García, et al.
0

A major goal in blind source separation to identify and separate sources is to model their inherent characteristics. While most state-of-the-art approaches are supervised methods trained on large datasets, interest in non-data-driven approaches such as Kernel Additive Modelling (KAM) remains high due to their interpretability and adaptability. KAM performs the separation of a given source applying robust statistics on the time-frequency bins selected by a source-specific kernel function, commonly the K-NN function. This choice assumes that the source of interest repeats in both time and frequency. In practice, this assumption does not always hold. Therefore, we introduce a shift-invariant kernel function capable of identifying similar spectral content even under frequency shifts. This way, we can considerably increase the amount of suitable sound material available to the robust statistics. While this leads to an increase in separation performance, a basic formulation, however, is computationally expensive. Therefore, we additionally present acceleration techniques that lower the overall computational complexity.

READ FULL TEXT
research
04/06/2018

Does k Matter? k-NN Hubness Analysis for Kernel Additive Modelling Vocal Separation

Kernel Additive Modelling (KAM) is a framework for source separation aim...
research
11/06/2018

Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures

Separating an audio scene into isolated sources is a fundamental problem...
research
03/10/2023

Distribution Preserving Source Separation With Time Frequency Predictive Models

We provide an example of a distribution preserving source separation met...
research
07/14/2020

Sudo rm -rf: Efficient Networks for Universal Audio Source Separation

In this paper, we present an efficient neural network for end-to-end gen...
research
02/01/2018

Approximate Message Passing for Underdetermined Audio Source Separation

Approximate message passing (AMP) algorithms have shown great promise in...
research
12/10/2018

A Computationally Efficient and Practically Feasible Two Microphones Blind Speech Separation Method

Traditionally, Blind Speech Separation techniques are computationally ex...
research
05/24/2018

FastFCA-AS: Joint Diagonalization Based Acceleration of Full-Rank Spatial Covariance Analysis for Separating Any Number of Sources

Here we propose FastFCA-AS, an accelerated algorithm for Full-rank spati...

Please sign up or login with your details

Forgot password? Click here to reset