IndyLSTMs: Independently Recurrent LSTMs

03/19/2019
by   Pedro Gonnet, et al.
0

We introduce Independently Recurrent Long Short-term Memory cells: IndyLSTMs. These differ from regular LSTM cells in that the recurrent weights are not modeled as a full matrix, but as a diagonal matrix, i.e. the output and state of each LSTM cell depends on the inputs and its own output/state, as opposed to the input and the outputs/states of all the cells in the layer. The number of parameters per IndyLSTM layer, and thus the number of FLOPS per evaluation, is linear in the number of nodes in the layer, as opposed to quadratic for regular LSTM layers, resulting in potentially both smaller and faster models. We evaluate their performance experimentally by training several models on the popular and CASIA online handwriting datasets, as well as on several of our in-house datasets. We show that IndyLSTMs, despite their smaller size, consistently outperform regular LSTMs both in terms of accuracy per parameter, and in best accuracy overall. We attribute this improved performance to the IndyLSTMs being less prone to overfitting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2017

Factorization tricks for LSTM networks

We present two simple ways of reducing the number of parameters and acce...
research
05/30/2018

Grow and Prune Compact, Fast, and Accurate LSTMs

Long short-term memory (LSTM) has been widely used for sequential data m...
research
01/31/2018

Nested LSTMs

We propose Nested LSTMs (NLSTM), a novel RNN architecture with multiple ...
research
08/16/2015

Depth-Gated LSTM

In this short note, we present an extension of long short-term memory (L...
research
09/15/2017

Learning Intrinsic Sparse Structures within Long Short-Term Memory

Model compression is significant for the wide adoption of Recurrent Neur...
research
05/30/2018

Grow and Prune Compact, Fast, and AccurateLSTMs

Long short-term memory (LSTM) has been widely used for sequential data m...
research
02/22/2021

The FaCells. An Exploratory Study about LSTM Layers on Face Sketches Classifiers

Lines are human mental abstractions. A bunch of lines may form a drawing...

Please sign up or login with your details

Forgot password? Click here to reset