Unsupervised Source Separation via Self-Supervised Training

02/08/2022
by   Ertuğ Karamatlı, et al.
0

We introduce two novel unsupervised (blind) source separation methods, which involve self-supervised training from single-channel two-source speech mixtures without any access to the ground truth source signals. Our first method employs permutation invariant training (PIT) to separate artificially-generated mixtures of the original mixtures back into the original mixtures, which we named mixture permutation invariant training (MixPIT). We found this challenging objective to be a valid proxy task for learning to separate the underlying sources. We improve upon this first method by creating mixtures of source estimates and employing PIT to separate these new mixtures in a cyclic fashion. We named this second method cyclic mixture permutation invariant training (MixCycle), where cyclic refers to the fact that we use the same model to produce artificial mixtures and to learn from them continuously. We show that MixPIT outperforms a common baseline (MixIT) on our small dataset (SC09Mix), and they have comparable performance on a standard dataset (LibriMix). Strikingly, we also show that MixCycle surpasses the performance of supervised PIT by being data-efficient, thanks to its inherent data augmentation mechanism. To the best of our knowledge, no other purely unsupervised method is able to match or exceed the performance of supervised training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2021

Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation

In this paper, we introduce a novel semi-supervised learning framework f...
research
09/01/2023

Remixing-based Unsupervised Source Separation from Scratch

We propose an unsupervised approach for training separation models from ...
research
06/23/2020

Unsupervised Sound Separation Using Mixtures of Mixtures

In recent years, rapid progress has been made on the problem of single-c...
research
11/18/2022

Self-Remixing: Unsupervised Speech Separation via Separation and Remixing

We present Self-Remixing, a novel self-supervised speech separation meth...
research
11/15/2022

Reverberation as Supervision for Speech Separation

This paper proposes reverberation as supervision (RAS), a novel unsuperv...
research
10/13/2021

One to Multiple Mapping Dual Learning: Learning Multiple Sources from One Mixed Signal

Single channel blind source separation (SCBSS) refers to separate multip...

Please sign up or login with your details

Forgot password? Click here to reset