Supervised Compression for Resource-constrained Edge Computing Systems

08/21/2021
by   Yoshitomo Matsubara, et al.
0

There has been much interest in deploying deep learning algorithms on low-powered devices, including smartphones, drones, and medical sensors. However, full-scale deep neural networks are often too resource-intensive in terms of energy and storage. As a result, the bulk part of the machine learning operation is therefore often carried out on an edge server, where the data is compressed and transmitted. However, compressing data (such as images) leads to transmitting information irrelevant to the supervised task. Another popular approach is to split the deep network between the device and the server while compressing intermediate features. To date, however, such split computing strategies have barely outperformed the aforementioned naive data compression baselines due to their inefficient approaches to feature compression. This paper adopts ideas from knowledge distillation and neural image compression to compress intermediate feature representations more efficiently. Our supervised compression approach uses a teacher model and a student model with a stochastic bottleneck and learnable prior for entropy coding. We compare our approach to various neural image and feature compression baselines in three vision tasks and found that it achieves better supervised rate-distortion performance while also maintaining smaller end-to-end latency. We furthermore show that the learned feature representations can be tuned to serve multiple downstream tasks.

READ FULL TEXT

page 4

page 8

research
03/16/2022

SC2: Supervised Compression for Split Computing

Split computing distributes the execution of a neural network (e.g., for...
research
04/15/2022

Feature Compression for Rate Constrained Object Detection on the Edge

Recent advances in computer vision has led to a growth of interest in de...
research
10/31/2019

BottleNet++: An End-to-End Approach for Feature Compression in Device-Edge Co-Inference Systems

The emergence of various intelligent mobile applications demands the dep...
research
09/17/2018

Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing

The recent advances of hardware technology have made the intelligent ana...
research
08/24/2022

A Low-Complexity Approach to Rate-Distortion Optimized Variable Bit-Rate Compression for Split DNN Computing

Split computing has emerged as a recent paradigm for implementation of D...
research
06/12/2022

Preprocessing Enhanced Image Compression for Machine Vision

Recently, more and more images are compressed and sent to the back-end d...
research
07/05/2022

Image Coding for Machines with Omnipotent Feature Learning

Image Coding for Machines (ICM) aims to compress images for AI tasks ana...

Please sign up or login with your details

Forgot password? Click here to reset