Binary Search and First Order Gradient Based Method for Stochastic Optimization

07/27/2020
by   Vijay Pandey, et al.
7

In this paper, we present a novel stochastic optimization method, which uses the binary search technique with first order gradient based optimization method, called Binary Search Gradient Optimization (BSG) or BiGrad. In this optimization setup, a non-convex surface is treated as a set of convex surfaces. In BSG, at first, a region is defined, assuming region is convex. If region is not convex, then the algorithm leaves the region very fast and defines a new one, otherwise, it tries to converge at the optimal point of the region. In BSG, core purpose of binary search is to decide, whether region is convex or not in logarithmic time, whereas, first order gradient based method is primarily applied, to define a new region. In this paper, Adam is used as a first order gradient based method, nevertheless, other methods of this class may also be considered. In deep neural network setup, it handles the problem of vanishing and exploding gradient efficiently. We evaluate BSG on the MNIST handwritten digit, IMDB, and CIFAR10 data set, using logistic regression and deep neural networks. We produce more promising results as compared to other first order gradient based optimization methods. Furthermore, proposed algorithm generalizes significantly better on unseen data as compared to other methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/29/2020

Conjugate-gradient-based Adam for stochastic optimization and its application to deep learning

This paper proposes a conjugate-gradient-based Adam algorithm blending A...
research
04/30/2018

optimParallel: an R Package Providing Parallel Versions of the Gradient-Based Optimization Methods of optim()

The R package optimParallel provides a parallel version of the gradient-...
research
07/24/2019

Robust and Communication-Efficient Collaborative Learning

We consider a decentralized learning problem, where a set of computing n...
research
11/18/2019

A Graph Autoencoder Approach to Causal Structure Learning

Causal structure learning has been a challenging task in the past decade...
research
02/17/2012

Extended Mixture of MLP Experts by Hybrid of Conjugate Gradient Method and Modified Cuckoo Search

This paper investigates a new method for improving the learning algorith...
research
12/02/2019

An Attention-Based Speaker Naming Method for Online Adaptation in Non-Fixed Scenarios

A speaker naming task, which finds and identifies the active speaker in ...
research
01/19/2015

Microscopic Advances with Large-Scale Learning: Stochastic Optimization for Cryo-EM

Determining the 3D structures of biological molecules is a key problem f...

Please sign up or login with your details

Forgot password? Click here to reset