Language Modeling with Highway LSTM

09/19/2017
by   Gakuto Kurata, et al.
0

Language models (LMs) based on Long Short Term Memory (LSTM) have shown good gains in many automatic speech recognition tasks. In this paper, we extend an LSTM by adding highway networks inside an LSTM and use the resulting Highway LSTM (HW-LSTM) model for language modeling. The added highway networks increase the depth in the time dimension. Since a typical LSTM has two internal states, a memory cell and a hidden state, we compare various types of HW-LSTM by adding highway networks onto the memory cell and/or the hidden state. Experimental results on English broadcast news and conversational telephone speech recognition show that the proposed HW-LSTM LM improves speech recognition accuracy on top of a strong LSTM LM baseline. We report 5.1 Switchboard and CallHome subsets of the Hub5 2000 evaluation, which reaches the best performance numbers reported on these tasks to date.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/05/2014

Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) archit...
research
11/07/2019

Boosting LSTM Performance Through Dynamic Precision Selection

The use of low numerical precision is a fundamental optimization include...
research
02/12/2021

Transformer Language Models with LSTM-based Cross-utterance Information Representation

The effective incorporation of cross-utterance information has the poten...
research
06/13/2017

Modelling prosodic structure using Artificial Neural Networks

The ability to accurately perceive whether a speaker is asking a questio...
research
06/18/2018

Semi-tied Units for Efficient Gating in LSTM and Highway Networks

Gating is a key technique used for integrating information from multiple...
research
06/05/2018

LSTM Benchmarks for Deep Learning Frameworks

This study provides benchmarks for different implementations of LSTM uni...
research
07/09/2019

Comparing the Performance of the LSTM and HMM Language Models via Structural Similarity

Language models based on deep neural networks and traditional stochastic...

Please sign up or login with your details

Forgot password? Click here to reset