Complex ISNMF: a Phase-Aware Model for Monaural Audio Source Separation

02/09/2018
by   Paul Magron, et al.
0

This paper introduces a phase-aware probabilistic model for audio source separation. Classical source models in the short-term Fourier transform domain use circularly-symmetric Gaussian or Poisson random variables. This is equivalent to assuming that the phase of each source is uniformly distributed, which is not suitable for exploiting the underlying structure of the phase. Drawing on preliminary works, we introduce here a Bayesian anisotropic Gaussian source model in which the phase is no longer uniform. Such a model permits us to favor a phase value that originates from a signal model through a Markov chain prior structure. The variance of the latent variables are structured with nonnegative matrix factorization (NMF). The resulting model is called complex Itakura-Saito NMF (ISNMF) since it generalizes the ISNMF model to the case of non-isotropic variables. It combines the advantages of ISNMF, which uses a distortion measure adapted to audio and yields a set of estimates which preserve the overall energy of the mixture, and of complex NMF, which enables one to account for some phase constraints. We derive a generalized expectation-maximization algorithm to estimate the model parameters. Experiments conducted on a musical source separation task in a semi-informed setting show that the proposed approach outperforms state-of-the-art phase-aware separation techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2018

Sparse Gaussian process Audio Source Separation Using Spectrum Priors in the Time-Domain

Gaussian process (GP) audio source separation is a time-domain approach ...
research
10/20/2020

Phase recovery with Bregman divergences for audio source separation

Time-frequency audio source separation is usually achieved by estimating...
research
06/21/2011

Online algorithms for Nonnegative Matrix Factorization with the Itakura-Saito divergence

Nonnegative matrix factorization (NMF) is now a common tool for audio so...
research
03/13/2019

Phase-aware Harmonic/Percussive Source Separation via Convex Optimization

Decomposition of an audio mixture into harmonic and percussive component...
research
11/08/2019

Online Spectrogram Inversion for Low-Latency Audio Source Separation

Audio source separation is usually achieved by estimating the short-time...
research
05/12/2011

Closed-form EM for Sparse Coding and its Application to Source Separation

We define and discuss the first sparse coding algorithm based on closed-...
research
06/28/2022

Algorithms for audio inpainting based on probabilistic nonnegative matrix factorization

Audio inpainting, i.e., the task of restoring missing or occluded audio ...

Please sign up or login with your details

Forgot password? Click here to reset