Surrogate Source Model Learning for Determined Source Separation

11/11/2020
by   Robin Scheibler, et al.
0

We propose to learn surrogate functions of universal speech priors for determined blind speech separation. Deep speech priors are highly desirable due to their high modelling power, but are not compatible with state-of-the-art independent vector analysis based on majorization-minimization (AuxIVA), since deriving the required surrogate function is not easy, nor always possible. Instead, we do away with exact majorization and directly approximate the surrogate. Taking advantage of iterative source steering (ISS) updates, we back propagate the permutation invariant separation loss through multiple iterations of AuxIVA. ISS lends itself well to this task due to its lower complexity and lack of matrix inversion. Experiments show large improvements in terms of scale invariant signal-to-distortion (SDR) ratio and word error rate compared to baseline methods. Training is done on two speakers mixtures and we experiment with two losses, SDR and coherence. We find that the learnt approximate surrogate generalizes well on mixtures of three and four speakers without any modification. We also demonstrate generalization to a different variation of the AuxIVA update equations. The SDR loss leads to fastest convergence in iterations, while coherence leads to the lowest word error rate (WER). We obtain as much as 36

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
11/30/2020

Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation

Time-domain training criteria have proven to be very effective for the s...
research
11/15/2021

Monaural source separation: From anechoic to reverberant environments

Impressive progress in neural network-based single-channel speech source...
research
04/18/2021

Many-Speakers Single Channel Speech Separation with Optimal Permutation Training

Single channel speech separation has experienced great progress in the l...
research
10/21/2022

Adversarial Permutation Invariant Training for Universal Sound Separation

Universal sound separation consists of separating mixes with arbitrary s...
research
05/31/2023

UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures

In reverberant conditions with multiple concurrent speakers, each microp...
research
08/23/2020

Independent Vector Analysis via Log-Quadratically Penalized Quadratic Minimization

We propose a new algorithm for blind source separation of convolutive mi...
research
08/23/2020

Independent Vector Analysis with Deep Neural Network Source Priors

This paper studies the density priors for independent vector analysis (I...

Please sign up or login with your details

Forgot password? Click here to reset