On Extended Long Short-term Memory and Dependent Bidirectional Recurrent Neural Network

02/27/2018
by   Yuanhang Su, et al.
0

In this work, we investigate the memory capability of recurrent neural networks (RNNs), where this capability is defined as a function that maps an element in a sequence to the current output. We first analyze the system function of a recurrent neural network (RNN) cell, and provide analytical results for three RNNs. They are the simple recurrent neural network (SRN), the long short-term memory (LSTM), and the gated recurrent unit (GRU). Based on the analysis, we propose a new design to extend the memory length of a cell, and call it the extended long short-term memory (ELSTM). Next, we present a dependent bidirectional recurrent neural network (DBRNN) for the sequence-in-sequence-out (SISO) problem, which is more robust to previous erroneous predictions. Extensive experiments are carried out on different language tasks to demonstrate the superiority of our proposed ELSTM and DBRNN solutions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2023

Recurrent Neural Networks and Long Short-Term Memory Networks: Tutorial and Survey

This is a tutorial paper on Recurrent Neural Network (RNN), Long Short-T...
research
09/12/2019

Understanding LSTM – a tutorial into Long Short-Term Memory Recurrent Neural Networks

Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN) are one of t...
research
05/28/2020

Learning Various Length Dependence by Dual Recurrent Neural Networks

Recurrent neural networks (RNNs) are widely used as a memory model for s...
research
12/14/2016

Real-time interactive sequence generation and control with Recurrent Neural Network ensembles

Recurrent Neural Networks (RNN), particularly Long Short Term Memory (LS...
research
08/24/2017

Learning the Enigma with Recurrent Neural Networks

Recurrent neural networks (RNNs) represent the state of the art in trans...
research
10/10/2017

Optimizing Long Short-Term Memory Recurrent Neural Networks Using Ant Colony Optimization to Predict Turbine Engine Vibration

This article expands on research that has been done to develop a recurre...
research
12/06/2019

Knowledge extraction from the learning of sequences in a long short term memory (LSTM) architecture

We introduce a general method to extract knowledge from a recurrent neur...

Please sign up or login with your details

Forgot password? Click here to reset