On the Initialization of Long Short-Term Memory Networks

12/22/2019
by   Mostafa Mehdipour Ghazi, et al.
28

Weight initialization is important for faster convergence and stability of deep neural networks training. In this paper, a robust initialization method is developed to address the training instability in long short-term memory (LSTM) networks. It is based on a normalized random initialization of the network weights that aims at preserving the variance of the network input and output in the same range. The method is applied to standard LSTMs for univariate time series regression and to LSTMs robust to missing values for multivariate disease progression modeling. The results show that in all cases, the proposed initialization method outperforms the state-of-the-art initialization techniques in terms of training convergence and generalization performance of the obtained solution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2023

Deep Long-Short Term Memory networks: Stability properties and Experimental validation

The aim of this work is to investigate the use of Incrementally Input-to...
research
08/19/2020

Demand Forecasting using Long Short-Term Memory Neural Networks

In this paper we investigate to what extent long short-term memory neura...
research
11/11/2019

Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals

Fundamental frequency is one of the most important parameters of human s...
research
06/15/2021

Economic Nowcasting with Long Short-Term Memory Artificial Neural Networks (LSTM)

Artificial neural networks (ANNs) have been the catalyst to numerous adv...
research
05/05/2020

Long short-term memory networks and laglasso for bond yield forecasting: Peeping inside the black box

Modern decision-making in fixed income asset management benefits from in...
research
08/16/2018

Robust training of recurrent neural networks to handle missing data for disease progression modeling

Disease progression modeling (DPM) using longitudinal data is a challeng...
research
01/27/2023

A critical look at deep neural network for dynamic system modeling

Neural network models become increasingly popular as dynamic modeling to...

Please sign up or login with your details

Forgot password? Click here to reset