Branch-and-Pruning Optimization Towards Global Optimality in Deep Learning

04/05/2021
by   Yuanwei Wu, et al.
0

It has been attracting more and more attention to understand the global optimality in deep learning (DL) recently. However, conventional DL solvers, have not been developed intentionally to seek for such global optimality. In this paper, we propose a novel approximation algorithm, BPGrad, towards optimizing deep models globally via branch and pruning. The proposed BPGrad algorithm is based on the assumption of Lipschitz continuity in DL, and as a result, it can adaptively determine the step size for the current gradient given the history of previous updates, wherein theoretically no smaller steps can achieve the global optimality. We prove that, by repeating such a branch-and-pruning procedure, we can locate the global optimality within finite iterations. Empirically an efficient adaptive solver based on BPGrad for DL is proposed as well, and it outperforms conventional DL solvers such as Adagrad, Adadelta, RMSProp, and Adam in the tasks of object recognition, detection, and segmentation. The code is available at <https://github.com/RyanCV/BPGrad>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2017

BPGrad: Towards Global Optimality in Deep Learning via Branch and Pruning

Understanding the global optimality in deep learning (DL) has been attra...
research
09/29/2022

Is Complexity Required for Neural Network Pruning? A Case Study on Global Magnitude Pruning

Pruning neural networks has become popular in the last decade when it wa...
research
08/16/2016

Dynamic Network Surgery for Efficient DNNs

Deep learning has become a ubiquitous technology to improve machine inte...
research
04/26/2022

Attentive Fine-Grained Structured Sparsity for Image Restoration

Image restoration tasks have witnessed great performance improvement in ...
research
10/21/2019

A Complexity Efficient DMT-Optimal Tree Pruning Based Sphere Decoding

We present a diversity multiplexing tradeoff (DMT) optimal tree pruning ...
research
08/20/2021

DL-Traff: Survey and Benchmark of Deep Learning Models for Urban Traffic Prediction

Nowadays, with the rapid development of IoT (Internet of Things) and CPS...
research
02/20/2023

Toward Asymptotic Optimality: Sequential Unsupervised Regression of Density Ratio for Early Classification

Theoretically-inspired sequential density ratio estimation (SDRE) algori...

Please sign up or login with your details

Forgot password? Click here to reset