Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition

10/11/2016
by   Xiangang Li, et al.
0

Long short-term memory (LSTM) recurrent neural networks (RNNs) have been shown to give state-of-the-art performance on many speech recognition tasks, as they are able to provide the learned dynamically changing contextual window of all sequence history. On the other hand, the convolutional neural networks (CNNs) have brought significant improvements to deep feed-forward neural networks (FFNNs), as they are able to better reduce spectral variation in the input signal. In this paper, a network architecture called as convolutional recurrent neural network (CRNN) is proposed by combining the CNN and LSTM RNN. In the proposed CRNNs, each speech frame, without adjacent context frames, is organized as a number of local feature patches along the frequency axis, and then a LSTM network is performed on each feature patch along the time axis. We train and compare FFNNs, LSTM RNNs and the proposed LSTM CRNNs at various number of configurations. Experimental results show that the LSTM CRNNs can exceed state-of-the-art speech recognition performance.

READ FULL TEXT
research
02/05/2014

Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) archit...
research
12/30/2019

Text Steganalysis with Attentional LSTM-CNN

With the rapid development of Natural Language Processing (NLP) technolo...
research
05/03/2022

Biometric Signature Verification Using Recurrent Neural Networks

Architectures based on Recurrent Neural Networks (RNNs) have been succes...
research
03/31/2015

Beyond Short Snippets: Deep Networks for Video Classification

Convolutional neural networks (CNNs) have been extensively applied for i...
research
05/28/2022

Go Beyond Multiple Instance Neural Networks: Deep-learning Models based on Local Pattern Aggregation

Deep convolutional neural networks (CNNs) have brought breakthroughs in ...
research
09/09/2019

Sequential Convolutional Recurrent Neural Networks for Fast Automatic Modulation Classification

A novel and efficient end-to-end learning model for automatic modulation...
research
11/27/2018

Are 2D-LSTM really dead for offline text recognition?

There is a recent trend in handwritten text recognition with deep neural...

Please sign up or login with your details

Forgot password? Click here to reset