A Convergence Analysis of Nonlinearly Constrained ADMM in Deep Learning

02/06/2019
by   Jinshan Zeng, et al.
0

Efficient training of deep neural networks (DNNs) is a challenge due to the associated highly nonconvex optimization. The alternating direction method of multipliers (ADMM) has attracted rising attention in deep learning for its potential of distributed computing. However, it remains an open problem to establish the convergence of ADMM in DNN training due to the nonlinear constraints involved. In this paper, we provide an answer to this problem by establishing the convergence of some nonlinearly constrained ADMM for DNNs with smooth activations. To be specific, we establish the global convergence to a Karush-Kuhn-Tucker (KKT) point at a O(1/k) rate. To achieve this goal, the key development lies in a new local linear approximation technique which enables us to overcome the hurdle of nonlinear constraints in ADMM for DNNs.

READ FULL TEXT
research
02/15/2018

Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers

We present a systematic weight pruning framework of deep neural networks...
research
11/09/2022

Nonlinear Set Membership Filter with State Estimation Constraints via Consensus-ADMM

This paper considers the state estimation problem for nonlinear dynamic ...
research
02/09/2023

Constrained Empirical Risk Minimization: Theory and Practice

Deep Neural Networks (DNNs) are widely used for their ability to effecti...
research
12/22/2021

A Convergent ADMM Framework for Efficient Neural Network Training

As a well-known optimization framework, the Alternating Direction Method...
research
03/01/2018

Block Coordinate Descent for Deep Learning: Unified Convergence Guarantees

Training deep neural networks (DNNs) efficiently is a challenge due to t...
research
05/31/2019

ADMM for Efficient Deep Learning with Global Convergence

Alternating Direction Method of Multipliers (ADMM) has been used success...
research
12/05/2018

ECC: Energy-Constrained Deep Neural Network Compression via a Bilinear Regression Model

Many DNN-enabled vision applications constantly operate under severe ene...

Please sign up or login with your details

Forgot password? Click here to reset