A network of deep neural networks for distant speech recognition

03/23/2017
by   Mirco Ravanelli, et al.
0

Despite the remarkable progress recently made in distant speech recognition, state-of-the-art technology still suffers from a lack of robustness, especially when adverse acoustic conditions characterized by non-stationary noises and reverberation are met. A prominent limitation of current systems lies in the lack of matching and communication between the various technologies involved in the distant speech recognition process. The speech enhancement and speech recognition modules are, for instance, often trained independently. Moreover, the speech enhancement normally helps the speech recognizer, but the output of the latter is not commonly used, in turn, to improve the speech enhancement. To address both concerns, we propose a novel architecture based on a network of deep neural networks, where all the components are jointly trained and better cooperate with each other thanks to a full communication scheme between them. Experiments, conducted using different datasets, tasks and acoustic conditions, revealed that the proposed framework can overtake other competitive solutions, including recent joint training approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2017

Batch-normalized joint training for DNN-based distant speech recognition

Improving distant speech recognition is a crucial step towards flexible ...
research
10/10/2017

Contaminated speech training methods for robust DNN-HMM distant speech recognition

Despite the significant progress made in the last years, state-of-the-ar...
research
06/13/2018

A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition

Speech recognizers trained on close-talking speech do not generalize to ...
research
07/15/2022

Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments

This paper describes noisy speech recognition for an augmented reality h...
research
11/19/2018

The PyTorch-Kaldi Speech Recognition Toolkit

The availability of open-source software is playing a remarkable role in...
research
12/17/2017

Deep Learning for Distant Speech Recognition

Deep learning is an emerging technology that is considered one of the mo...
research
11/26/2017

Realistic multi-microphone data simulation for distant speech recognition

The availability of realistic simulated corpora is of key importance for...

Please sign up or login with your details

Forgot password? Click here to reset