An improved DNN-based spectral feature mapping that removes noise and reverberation for robust automatic speech recognition

03/23/2018
by   Juan Pablo Escudero, et al.
0

Reverberation and additive noise have detrimental effects on the performance of automatic speech recognition systems. In this paper we explore the ability of a DNN-based spectral feature mapping to remove the effects of reverberation and additive noise. Experiments with the CHiME-2 database show that this DNN can achieve an average reduction in WER of 4.5 system, at SNRs equal to -6 dB, -3 dB, 0 dB and 3 dB, and just 0.8 SNRs of 6 dB and 9 dB. These results suggest that this DNN is more effective in removing additive noise than reverberation. To improve the DNN performance, we combine it with the weighted prediction error (WPE) method that shows a complementary behavior. While this combination provided a reduction in WER of approximately 11 not as great as that obtained using WPE alone. However, modifications to the DNN training process were applied and an average reduction in WER equal to 18.3 improved DNN combined with WPE achieves a reduction in WER of 7.9 compared with WPE alone.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2014

Spatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments

We propose a spatial diffuseness feature for deep neural network (DNN)-b...
research
10/10/2000

On a cepstrum-based speech detector robust to white noise

We study effects of additive white noise on the cepstral representation ...
research
03/08/2022

Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation Systems

Recent advancements in deep learning have led to drastic improvements in...
research
05/29/2017

DNN-based uncertainty estimation for weighted DNN-HMM ASR

In this paper, the uncertainty is defined as the mean square error betwe...
research
04/12/2018

Global SNR Estimation of Speech Signals using Entropy and Uncertainty Estimates from Dropout Networks

This paper demonstrates two novel methods to estimate the global SNR of ...
research
12/24/2013

Speech Recognition Front End Without Information Loss

Speech representation and modelling in high-dimensional spaces of acoust...
research
06/17/2019

On combining features for single-channel robust speech recognition in reverberant environments

This paper addresses the combination of complementary parallel speech re...

Please sign up or login with your details

Forgot password? Click here to reset