MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition

02/22/2023
by   Jiaming Zhou, et al.
0

End-to-end automatic speech recognition (ASR) usually suffers from performance degradation when applied to a new domain due to domain shift. Unsupervised domain adaptation (UDA) aims to improve the performance on the unlabeled target domain by transferring knowledge from the source to the target domain. To improve transferability, existing UDA approaches mainly focus on matching the distributions of the source and target domains globally and/or locally, while ignoring the model discriminability. In this paper, we propose a novel UDA approach for ASR via inter-domain MAtching and intra-domain DIscrimination (MADI), which improves the model transferability by fine-grained inter-domain matching and discriminability by intra-domain contrastive discrimination simultaneously. Evaluations on the Libri-Adapt dataset demonstrate the effectiveness of our approach. MADI reduces the relative word error rate (WER) on cross-device and cross-environment ASR by 17.7 respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2021

Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching

End-to-end automatic speech recognition (ASR) can achieve promising perf...
research
07/19/2017

Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation

Domain mismatch between training and testing can lead to significant deg...
research
04/03/2023

Unsupervised Cross-domain Pulmonary Nodule Detection without Source Data

Cross domain pulmonary nodule detection suffers from performance degrada...
research
08/30/2023

Semi-supervised Domain Adaptation with Inter and Intra-domain Mixing for Semantic Segmentation

Despite recent advances in semantic segmentation, an inevitable challeng...
research
09/18/2023

Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models

While Automatic Speech Recognition (ASR) systems are widely used in many...
research
04/17/2019

Hard Sample Mining for the Improved Retraining of Automatic Speech Recognition

It is an effective way that improves the performance of the existing Aut...
research
03/27/2022

Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition

Although deep learning-based end-to-end Automatic Speech Recognition (AS...

Please sign up or login with your details

Forgot password? Click here to reset