A Telescopic Binary Learning Machine for Training Neural Networks

09/01/2015
by   Mauro Brunato, et al.
0

This paper proposes a new algorithm based on multi-scale stochastic local search with binary representation for training neural networks. In particular, we study the effects of neighborhood evaluation strategies, the effect of the number of bits per weight and that of the maximum weight range used for mapping binary strings to real values. Following this preliminary investigation, we propose a telescopic multi-scale version of local search where the number of bits is increased in an adaptive manner, leading to a faster search and to local minima of better quality. An analysis related to adapting the number of bits in a dynamic way is also presented. The control on the number of bits, which happens in a natural manner in the proposed method, is effective to increase the generalization performance. Benchmark tasks include a highly non-linear artificial problem, a control problem requiring either feed-forward or recurrent architectures for feedback control, and challenging real-world tasks in different application domains. The results demonstrate the effectiveness of the proposed method.

READ FULL TEXT

page 7

page 8

research
11/30/2019

A binary-activation, multi-level weight RNN and training algorithm for processing-in-memory inference with eNVM

We present a new algorithm for training neural networks with binary acti...
research
11/16/2018

Composite Binary Decomposition Networks

Binary neural networks have great resource and computing efficiency, whi...
research
12/14/2013

ECOC-Based Training of Neural Networks for Face Recognition

Error Correcting Output Codes, ECOC, is an output representation method ...
research
11/27/2014

Unweighted Stochastic Local Search can be Effective for Random CSP Benchmarks

We present ULSA, a novel stochastic local search algorithm for random bi...
research
02/27/2020

Improving the Performance of Stochastic Local Search for Maximum Vertex Weight Clique Problem Using Programming by Optimization

The maximum vertex weight clique problem (MVWCP) is an important general...
research
03/27/2023

A Linear Weight Transfer Rule for Local Search

The Divide and Distribute Fixed Weights algorithm (ddfw) is a dynamic lo...
research
10/03/2022

Random Weight Factorization Improves the Training of Continuous Neural Representations

Continuous neural representations have recently emerged as a powerful an...

Please sign up or login with your details

Forgot password? Click here to reset