An Inexact Augmented Lagrangian Algorithm for Training Leaky ReLU Neural Network with Group Sparsity

05/11/2022
by   Wei Liu, et al.
0

The leaky ReLU network with a group sparse regularization term has been widely used in the recent years. However, training such a network yields a nonsmooth nonconvex optimization problem and there exists a lack of approaches to compute a stationary point deterministically. In this paper, we first resolve the multi-layer composite term in the original optimization problem by introducing auxiliary variables and additional constraints. We show the new model has a nonempty and bounded solution set and its feasible set satisfies the Mangasarian-Fromovitz constraint qualification. Moreover, we show the relationship between the new model and the original problem. Remarkably, we propose an inexact augmented Lagrangian algorithm for solving the new model and show the convergence of the algorithm to a KKT point. Numerical experiments demonstrate that our algorithm is more efficient for training sparse leaky ReLU neural networks than some well-known algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2022

Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions

We develop fast algorithms and robust software for convex optimization o...
research
08/25/2021

A New Insight on Augmented Lagrangian Method and Its Extensions

Motivated by the recent work [He-Yuan, Balanced Augmented Lagrangian Met...
research
04/15/2020

Augmented Lagrangian preconditioners for the Oseen-Frank model of cholesteric liquid crystals

We propose a robust and efficient augmented Lagrangian-type precondition...
research
06/05/2023

Does a sparse ReLU network training problem always admit an optimum?

Given a training set, a loss function, and a neural network architecture...
research
01/19/2022

Multiblock ADMM for nonsmooth nonconvex optimization with nonlinear coupling constraints

This paper considers a multiblock nonsmooth nonconvex optimization probl...
research
07/28/2023

Weighted variation spaces and approximation by shallow ReLU networks

We investigate the approximation of functions f on a bounded domain Ω⊂ℝ^...
research
08/08/2023

Minimizing Quotient Regularization Model

Quotient regularization models (QRMs) are a class of powerful regulariza...

Please sign up or login with your details

Forgot password? Click here to reset