Phase recovery with Bregman divergences for audio source separation

10/20/2020
by   Paul Magron, et al.
0

Time-frequency audio source separation is usually achieved by estimating the short-time Fourier transform (STFT) magnitude of each source, and then applying a phase recovery algorithm to retrieve time-domain signals. In particular, the multiple input spectrogram inversion (MISI) algorithm has shown good performance in several recent works. This algorithm minimizes a quadratic reconstruction error between magnitude spectrograms. However, this loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. In this paper, we propose to reformulate phase recovery in audio source separation as a minimization problem involving Bregman divergences. To optimize the resulting objective, we derive a projected gradient descent algorithm. Experiments conducted on a speech enhancement task show that this approach outperforms MISI for several alternative losses, which highlights their relevance for audio source separation applications.

READ FULL TEXT
research
11/08/2019

Online Spectrogram Inversion for Low-Latency Audio Source Separation

Audio source separation is usually achieved by estimating the short-time...
research
03/03/2023

Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints

Audio source separation is often achieved by estimating the magnitude sp...
research
07/30/2018

Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery

Harmonic/percussive source separation (HPSS) consists in separating the ...
research
02/09/2018

Complex ISNMF: a Phase-Aware Model for Monaural Audio Source Separation

This paper introduces a phase-aware probabilistic model for audio source...
research
07/25/2020

AutoClip: Adaptive Gradient Clipping for Source Separation Networks

Clipping the gradient is a known approach to improving gradient descent,...
research
09/30/2016

Phase Unmixing : Multichannel Source Separation with Magnitude Constraints

We consider the problem of estimating the phases of K mixed complex sign...
research
10/01/2020

Phase retrieval with Bregman divergences and application to audio signal recovery

Phase retrieval (PR) aims to recover a signal from the magnitudes of a s...

Please sign up or login with your details

Forgot password? Click here to reset