Enhancing Unsupervised Speech Recognition with Diffusion GANs

03/23/2023
by   Xianchao Wu, et al.
0

We enhance the vanilla adversarial training method for unsupervised Automatic Speech Recognition (ASR) by a diffusion-GAN. Our model (1) injects instance noises of various intensities to the generator's output and unlabeled reference text which are sampled from pretrained phoneme language models with a length constraint, (2) asks diffusion timestep-dependent discriminators to separate them, and (3) back-propagates the gradients to update the generator. Word/phoneme error rate comparisons with wav2vec-U under Librispeech (3.1 test-clean and 5.6 enhancement strategies work effectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2018

Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training

In realistic environments, speech is usually interfered by various noise...
research
04/08/2019

Completely Unsupervised Phoneme Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models

Producing a large annotated speech corpus for training ASR systems remai...
research
03/26/2021

BART based semantic correction for Mandarin automatic speech recognition system

Although automatic speech recognition (ASR) systems achieved significant...
research
09/04/2018

HASP: A High-Performance Adaptive Mobile Security Enhancement Against Malicious Speech Recognition

Nowadays, machine learning based Automatic Speech Recognition (ASR) tech...
research
10/14/2022

TransFusion: Transcribing Speech with Multinomial Diffusion

Diffusion models have shown exceptional scaling properties in the image ...
research
04/12/2021

Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures

Recent publications on automatic-speech-recognition (ASR) have a strong ...
research
03/09/2022

A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling

Automatic speech recognition (ASR) systems used on smart phones or vehic...

Please sign up or login with your details

Forgot password? Click here to reset