Tikhonov Regularization for Long Short-Term Memory Networks

08/09/2017
by   Andrei Turkin, et al.
0

It is a well-known fact that adding noise to the input data often improves network performance. While the dropout technique may be a cause of memory loss, when it is applied to recurrent connections, Tikhonov regularization, which can be regarded as the training with additive noise, avoids this issue naturally, though it implies regularizer derivation for different architectures. In case of feedforward neural networks this is straightforward, while for networks with recurrent connections and complicated layers it leads to some difficulties. In this paper, a Tikhonov regularizer is derived for Long-Short Term Memory (LSTM) networks. Although it is independent of time for simplicity, it considers interaction between weights of the LSTM unit, which in theory makes it possible to regularize the unit with complicated dependences by using only one parameter that measures the input data perturbation. The regularizer that is proposed in this paper has three parameters: one to control the regularization process, and other two to maintain computation stability while the network is being trained. The theory developed in this paper can be applied to get such regularizers for different recurrent neural networks with Hadamard products and Lipschitz continuous functions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2018

Fast Weight Long Short-Term Memory

Associative memory using fast weights is a short-term memory mechanism t...
research
04/05/2019

Short note on the behavior of recurrent neural network for noisy dynamical system

The behavior of recurrent neural network for the data-driven simulation ...
research
02/24/2017

How hard is it to cross the room? -- Training (Recurrent) Neural Networks to steer a UAV

This work explores the feasibility of steering a drone with a (recurrent...
research
10/08/2021

Kinematically consistent recurrent neural networks for learning inverse problems in wave propagation

Although machine learning (ML) is increasingly employed recently for mec...
research
10/09/2020

Recurrent babbling: evaluating the acquisition of grammar from limited input data

Recurrent Neural Networks (RNNs) have been shown to capture various aspe...
research
02/20/2018

On the Statistical Challenges of Echo State Networks and Some Potential Remedies

Echo state networks are powerful recurrent neural networks. However, the...
research
01/05/2021

Recurrent Neural Networks for Stochastic Control Problems with Delay

Stochastic control problems with delay are challenging due to the path-d...

Please sign up or login with your details

Forgot password? Click here to reset