Recent Progresses in Deep Learning based Acoustic Models (Updated)

04/25/2018
by   Dong Yu, et al.
0

In this paper, we summarize recent progresses made in deep learning based acoustic models and the motivation and insights behind the surveyed techniques. We first discuss acoustic models that can effectively exploit variable-length contextual information, such as recurrent neural networks (RNNs), convolutional neural networks (CNNs), and their various combination with other models. We then describe acoustic models that are optimized end-to-end with emphasis on feature representations learned jointly with rest of the system, the connectionist temporal classification (CTC) criterion, and the attention-based sequence-to-sequence model. We further illustrate robustness issues in speech recognition systems, and discuss acoustic model adaptation, speech enhancement and separation, and robust training strategies. We also cover modeling techniques that lead to more efficient decoding and discuss possible future directions in acoustic model research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/10/2017

Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are effective models for reducing s...
research
03/14/2017

Multichannel End-to-end Speech Recognition

The field of speech recognition is in the midst of a paradigm shift: end...
research
03/01/2016

Segmental Recurrent Neural Networks for End-to-end Speech Recognition

We study the segmental recurrent neural network for end-to-end acoustic ...
research
08/02/2018

Linguistic Search Optimization for Deep Learning Based LVCSR

Recent advances in deep learning based large vocabulary con- tinuous spe...
research
02/13/2019

Recurrent Neural Networks with Stochastic Layers for Acoustic Novelty Detection

In this paper, we adapt Recurrent Neural Networks with Stochastic Layers...
research
06/05/2023

DeepVQE: Real Time Deep Voice Quality Enhancement for Joint Acoustic Echo Cancellation, Noise Suppression and Dereverberation

Acoustic echo cancellation (AEC), noise suppression (NS) and dereverbera...

Please sign up or login with your details

Forgot password? Click here to reset