Robust error bounds for quantised and pruned neural networks

11/30/2020
by   Jiaqi Li, et al.
0

With the rise of smartphones and the internet-of-things, data is increasingly getting generated at the edge on local, personal devices. For privacy, latency and energy saving reasons, this shift is causing machine learning algorithms to move towards a decentralised approach, with the data and algorithms stored and even trained locally on devices. The device hardware becomes the main bottleneck for model performance in this set-up, creating a need for slimmed down, more efficient neural networks. Neural network pruning and quantisation are two methods that have been developed to achieve this, with both approaches demonstrating impressive results in reducing the computational cost without sacrificing too much on model performance. However, our understanding behind these methods remains underdeveloped. To address this issue, a semi-definite program to robustly bound the error caused by pruning and quantising a neural network is introduced in this paper. The method can be applied to generic neural networks, accounts for the many nonlinearities of the problem and holds robustly for all inputs in specified sets. It is hoped that the computed bounds will give certainty to software/control/machine learning engineers implementing these algorithms efficiently on limited hardware.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2021

Reduced-Order Neural Network Synthesis with Robustness Guarantees

In the wake of the explosive growth in smartphones and cyberphysical sys...
research
07/08/2023

Towards Efficient In-memory Computing Hardware for Quantized Neural Networks: State-of-the-art, Open Challenges and Perspectives

The amount of data processed in the cloud, the development of Internet-o...
research
03/25/2021

Prototype-based Personalized Pruning

Nowadays, as edge devices such as smartphones become prevalent, there ar...
research
11/03/2021

Communication-Efficient Separable Neural Network for Distributed Inference on Edge Devices

The inference of Neural Networks is usually restricted by the resources ...
research
06/15/2020

Now that I can see, I can improve: Enabling data-driven finetuning of CNNs on the edge

In today's world, a vast amount of data is being generated by edge devic...
research
11/05/2021

Frugal Machine Learning

Machine learning, already at the core of increasingly many systems and a...

Please sign up or login with your details

Forgot password? Click here to reset