NeuPart: Using Analytical Models to Drive Energy-Efficient Partitioning of CNN Computations on Cloud-Connected Mobile Clients

05/09/2019
by   Susmita Dey Manasi, et al.
0

Data processing on convolutional neural networks (CNNs) places a heavy burden on energy-constrained mobile platforms. This work optimizes energy on a mobile client by partitioning CNN computations between in situ processing on the client and offloaded computations in the cloud. A new analytical CNN energy model is formulated, capturing all major components of the in situ computation, for ASIC-based deep learning accelerators. The model is benchmarked against measured silicon data. The analytical framework is used to determine the energy optimal partition point between the client and the cloud at runtime. On standard CNN topologies, partitioned computation is demonstrated to provide significant energy savings on the client over fully cloud-based or fully in situ computation. For example, at 60 Mbps bit rate and 0.5 W transmission power, the optimal partition for AlexNet [SqueezeNet] saves up to 47.4 energy over fully cloud-based computation, and 31.3 in situ computation.

READ FULL TEXT

page 1

page 13

research
10/15/2017

NeuralPower: Predict and Deploy Energy-Efficient Convolutional Neural Networks

"How much energy is consumed for an inference made by a convolutional ne...
research
11/26/2020

Energy Drain of the Object Detection Processing Pipeline for Mobile Devices: Analysis and Implications

Applying deep learning to object detection provides the capability to ac...
research
11/29/2018

Composable secure multi-client delegated quantum computation

The engineering challenges involved in building large scale quantum comp...
research
06/14/2016

A Systematic Approach to Blocking Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are the state of the art solution f...
research
01/25/2018

JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services

Deep neural networks are among the most influential architectures of dee...
research
03/10/2018

Newton: Gravitating Towards the Physical Limits of Crossbar Acceleration

Many recent works have designed accelerators for Convolutional Neural Ne...
research
09/29/2021

Partitioning Cloud-based Microservices (via Deep Learning)

Cloud-based software has many advantages. When services are divided into...

Please sign up or login with your details

Forgot password? Click here to reset