Training Recurrent Neural Networks by Sequential Least Squares and the Alternating Direction Method of Multipliers

12/31/2021
by   Alberto Bemporad, et al.
0

For training recurrent neural network models of nonlinear dynamical systems from an input/output training dataset based on rather arbitrary convex and twice-differentiable loss functions and regularization terms, we propose the use of sequential least squares for determining the optimal network parameters and hidden states. In addition, to handle non-smooth regularization terms such as L1, L0, and group-Lasso regularizers, as well as to impose possibly non-convex constraints such as integer and mixed-integer constraints, we combine sequential least squares with the alternating direction method of multipliers (ADMM). The performance of the resulting algorithm, that we call NAILS (Nonconvex ADMM Iterations and Least Squares), is tested in a nonlinear system identification benchmark.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2021

Recurrent Neural Network Training with Convex Loss and Regularization Functions by Extended Kalman Filtering

We investigate the use of extended Kalman filtering to train recurrent n...
research
02/10/2021

A Framework of Inertial Alternating Direction Method of Multipliers for Non-Convex Non-Smooth Optimization

In this paper, we propose an algorithmic framework dubbed inertial alter...
research
09/06/2020

An Analysis of Alternating Direction Method of Multipliers for Feed-forward Neural Networks

In this work, we present a hardware compatible neural network training a...
research
07/29/2021

Distributed Identification of Contracting and/or Monotone Network Dynamics

This paper proposes methods for identification of large-scale networked ...
research
08/15/2019

Discretely-constrained deep network for weakly supervised segmentation

An efficient strategy for weakly-supervised segmentation is to impose co...
research
10/04/2021

An AO-ADMM approach to constraining PARAFAC2 on all modes

Analyzing multi-way measurements with variations across one mode of the ...
research
01/13/2022

Recursive Least Squares Policy Control with Echo State Network

The echo state network (ESN) is a special type of recurrent neural netwo...

Please sign up or login with your details

Forgot password? Click here to reset