Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition

11/22/2016
by   Zewang Zhang, et al.
0

A deep learning approach has been widely applied in sequence modeling problems. In terms of automatic speech recognition (ASR), its performance has significantly been improved by increasing large speech corpus and deeper neural network. Especially, recurrent neural network and deep convolutional neural network have been applied in ASR successfully. Given the arising problem of training speed, we build a novel deep recurrent convolutional network for acoustic modeling and then apply deep residual learning to it. Our experiments show that it has not only faster convergence speed but better recognition accuracy over traditional deep convolutional recurrent network. In the experiments, we compare the convergence speed of our novel deep recurrent convolutional networks and traditional deep convolutional recurrent networks. With faster convergence speed, our novel deep recurrent convolutional networks can reach the comparable performance. We further show that applying deep residual learning can boost the convergence speed of our novel deep recurret convolutional networks. Finally, we evaluate all our experimental networks by phoneme error rate (PER) with our proposed bidirectional statistical n-gram language model. Our evaluation results show that our newly proposed deep recurrent convolutional network applied with deep residual learning can reach the best PER of 17.33% with the fastest convergence speed on TIMIT database. The outstanding performance of our novel deep recurrent convolutional neural network with deep residual learning indicates that it can be potentially adopted in other sequential problems.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 9

research
10/10/2016

Very Deep Convolutional Networks for End-to-End Speech Recognition

Sequence-to-sequence models have shown success in end-to-end speech reco...
research
11/12/2019

A Probabilistic Approach for Predicting Landslides by Learning a Self-Aligned Deep Convolutional Model

Landslides are movement of soil and rock under the influence of gravity....
research
06/15/2017

The Compressed Model of Residual CNDS

Convolutional neural networks have achieved a great success in the recen...
research
12/14/2016

Incorporating Language Level Information into Acoustic Models

This paper proposed a class of novel Deep Recurrent Neural Networks whic...
research
10/24/2017

Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention

This paper describes a novel text-to-speech (TTS) technique based on dee...
research
06/02/2020

Deep Learning in Target Space

Deep learning uses neural networks which are parameterised by their weig...
research
07/23/2020

Applying GPGPU to Recurrent Neural Network Language Model based Fast Network Search in the Real-Time LVCSR

Recurrent Neural Network Language Models (RNNLMs) have started to be use...

Please sign up or login with your details

Forgot password? Click here to reset