Deploy Large-Scale Deep Neural Networks in Resource Constrained IoT Devices with Local Quantization Region

05/24/2018
by   Yi Yang, et al.
12

Implementing large-scale deep neural networks with high computational complexity on low-cost IoT devices may inevitably be constrained by limited computation resource, making the devices hard to respond in real-time. This disjunction makes the state-of-art deep learning algorithms, i.e. CNN (Convolutional Neural Networks), incompatible with IoT world. We present a low-bit (range from 8-bit to 1-bit) scheme with our local quantization region algorithm. We use models in Caffe model zoo as our example tasks to evaluate the effect of our low precision data representation scheme. With the available of local quantization region, we find implementations on top of those schemes could greatly retain the model accuracy, besides the reduction of computational complexity. For example, our 8-bit scheme has no drops on top-1 and top-5 accuracy with 2x speedup on Intel Edison IoT platform. Implementations based on our 4-bit, 2-bit or 1-bit scheme are also applicable to IoT devices with advances of low computational complexity. For example, the drop on our task is only 0.7 transistors. Making low-bit scheme usable here opens a new door for further optimization on commodity IoT controller, i.e. extra speed-up could be achieved by replacing multiply-accumulate operations with the proposed table look-up operations. The whole study offers a new approach to relief the challenge of bring advanced deep learning algorithm to resource constrained low-cost IoT device.

READ FULL TEXT

page 4

page 5

page 6

page 7

research
07/11/2022

Sparsifying Binary Networks

Binary neural networks (BNNs) have demonstrated their ability to solve c...
research
09/01/2023

Yet another Improvement of Plantard Arithmetic for Faster Kyber on Low-end 32-bit IoT Devices

This paper presents another improved version of Plantard arithmetic that...
research
03/09/2022

Power-of-Two Quantization for Low Bitwidth and Hardware Compliant Neural Networks

Deploying Deep Neural Networks in low-power embedded devices for real ti...
research
04/02/2020

Efficient Online Classification and Tracking on Resource-constrained IoT Devices

Timely processing has been increasingly required on smart IoT devices, w...
research
08/03/2022

A Lightweight Transmission Parameter Selection Scheme Using Reinforcement Learning for LoRaWAN

The number of IoT devices is predicted to reach 125 billion by 2023. The...
research
06/07/2022

Enhancing Strong PUF Security with Non-monotonic Response Quantization

Strong physical unclonable functions (PUFs) provide a low-cost authentic...
research
08/20/2021

Quantization Backdoors to Deep Learning Models

There is currently a burgeoning demand for deploying deep learning (DL) ...

Please sign up or login with your details

Forgot password? Click here to reset