Mixup-breakdown: a consistency training method for improving generalization of speech separation models

10/28/2019
by   Max W. Y. Lam, et al.
0

Deep-learning based speech separation models confront poor generalization problem that even the state-of-the-art models could abruptly fail when evaluating them in mismatch conditions. To address this problem, we propose an easy-to-implement yet effective consistency based semi-supervised learning (SSL) approach, namely Mixup-Breakdown training (MBT). It learns a teacher model to "breakdown" unlabeled inputs, and the estimated separations are interpolated to produce more useful pseudo "mixup" input-output pairs, on which the consistency regularization could apply for learning a student model. In our experiment, we evaluate MBT under various conditions with ascending degrees of mismatch, including unseen interfering speech, noise, and music, and compare MBT's generalization capability against state-of-the-art supervised learning and SSL approaches. The result indicates that MBT significantly outperforms several strong baselines with up to 13.77 Moreover, MBT only adds negligible computational overhead to standard training schemes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2021

MisMatch: Learning to Change Predictive Confidences with Attention for Consistency-Based, Semi-Supervised Medical Image Segmentation

The lack of labels is one of the fundamental constraints in deep learnin...
research
06/15/2021

Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation

In this paper, we introduce a novel semi-supervised learning framework f...
research
05/14/2022

Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing

Consistency regularization has recently been applied to semi-supervised ...
research
03/19/2022

Learning Morphological Feature Perturbations for Calibrated Semi-Supervised Segmentation

We propose MisMatch, a novel consistency-driven semi-supervised segmenta...
research
10/16/2022

PCR: Pessimistic Consistency Regularization for Semi-Supervised Segmentation

Currently, state-of-the-art semi-supervised learning (SSL) segmentation ...
research
11/12/2018

Distributionally Robust Semi-Supervised Learning for People-Centric Sensing

Semi-supervised learning is crucial for alleviating labelling burdens in...
research
07/15/2022

PodcastMix: A dataset for separating music and speech in podcasts

We introduce PodcastMix, a dataset formalizing the task of separating ba...

Please sign up or login with your details

Forgot password? Click here to reset