Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition

03/27/2018
by   Ke Wang, et al.
0

We investigate the use of generative adversarial networks (GANs) in speech dereverberation for robust speech recognition. GANs have been recently studied for speech enhancement to remove additive noises, but there still lacks of a work to examine their ability in speech dereverberation and the advantages of using GANs have not been fully established. In this paper, we provide deep investigations in the use of GAN-based dereverberation front-end in ASR. First, we study the effectiveness of different dereverberation networks (the generator in GAN) and find that LSTM leads a significant improvement as compared with feed-forward DNN and CNN in our dataset. Second, further adding residual connections in the deep LSTMs can boost the performance as well. Finally, we find that, for the success of GAN, it is important to update the generator and the discriminator using the same mini-batch data during training. Moreover, using reverberant spectrogram as a condition to discriminator, as suggested in previous studies, may degrade the performance. In summary, our GAN-based dereverberation front-end achieves 14 to the baseline DNN dereverberation network when tested on a strong multi-condition training acoustic model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2017

Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition

We investigate the effectiveness of generative adversarial networks (GAN...
research
05/02/2018

Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training

In realistic environments, speech is usually interfered by various noise...
research
03/10/2021

Fine-tuning of Pre-trained End-to-end Speech Recognition with Generative Adversarial Networks

Adversarial training of end-to-end (E2E) ASR systems using generative ad...
research
06/13/2020

Dynamic Attention Based Generative Adversarial Network with Phase Post-Processing for Speech Enhancement

The generative adversarial networks (GANs) have facilitated the developm...
research
04/08/2019

Completely Unsupervised Phoneme Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models

Producing a large annotated speech corpus for training ASR systems remai...
research
08/26/2019

End-to-End Conditional GAN-based Architectures for Image Colourisation

In this work recent advances in conditional adversarial networks are inv...
research
04/07/2023

Correcting Model Misspecification via Generative Adversarial Networks

Machine learning models are often misspecified in the likelihood, which ...

Please sign up or login with your details

Forgot password? Click here to reset