Frame Stacking and Retaining for Recurrent Neural Network Acoustic Model

05/17/2017
by   Xu Tian, et al.
0

Frame stacking is broadly applied in end-to-end neural network training like connectionist temporal classification (CTC), and it leads to more accurate models and faster decoding. However, it is not well-suited to conventional neural network based on context-dependent state acoustic model, if the decoder is unchanged. In this paper, we propose a novel frame retaining method which is applied in decoding. The system which combined frame retaining with frame stacking could reduces the time consumption of both training and decoding. Long short-term memory (LSTM) recurrent neural networks (RNNs) using it achieve almost linear training speedup and reduces relative 41% real time factor (RTF). At the same time, recognition performance is no degradation or improves sightly on Shenma voice search dataset in Mandarin.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2015

Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition

We have recently shown that deep Long Short-Term Memory (LSTM) recurrent...
research
08/13/2020

Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction

We analyze the performance of feedforward vs. recurrent neural network (...
research
08/01/2017

End-to-End Neural Segmental Models for Speech Recognition

Segmental models are an alternative to frame-based models for sequence p...
research
07/30/2023

RoseNNa: A performant, portable library for neural network inference with application to computational fluid dynamics

The rise of neural network-based machine learning ushered in high-level ...
research
02/04/2020

On the use of recurrent neural networks for predictions of turbulent flows

In this paper, the prediction capabilities of recurrent neural networks ...
research
12/31/2018

Predicting Aircraft Trajectories: A Deep Generative Convolutional Recurrent Neural Networks Approach

Reliable 4D aircraft trajectory prediction, whether in a real-time setti...
research
08/16/2018

Automatic Chord Recognition with Higher-Order Harmonic Language Modelling

Common temporal models for automatic chord recognition model chord chang...

Please sign up or login with your details

Forgot password? Click here to reset