Real-Time Edge Classification: Optimal Offloading under Token Bucket Constraints

10/26/2020
by   Ayan Chakrabarti, et al.
0

To deploy machine learning-based algorithms for real-time applications with strict latency constraints, we consider an edge-computing setting where a subset of inputs are offloaded to the edge for processing by an accurate but resource-intensive model, and the rest are processed only by a less-accurate model on the device itself. Both models have computational costs that match available compute resources, and process inputs with low-latency. But offloading incurs network delays, and to manage these delays to meet application deadlines, we use a token bucket to constrain the average rate and burst length of transmissions from the device. We introduce a Markov Decision Process-based framework to make offload decisions under these constraints, based on the local model's confidence and the token bucket state, with the goal of minimizing a specified error measure for the application. Beyond isolated decisions for individual devices, we also propose approaches to allow multiple devices connected to the same access switch to share their bursting allocation. We evaluate and analyze the policies derived using our framework on the standard ImageNet image classification benchmark.

READ FULL TEXT

page 2

page 19

research
01/27/2022

Resource Provisioning in Edge Computing for Latency Sensitive Applications

Low-Latency IoT applications such as autonomous vehicles, augmented/virt...
research
07/31/2022

Adaptive Edge Offloading for Image Classification Under Rate Limit

This paper considers a setting where embedded devices are used to acquir...
research
12/07/2020

Cost-effective Machine Learning Inference Offload for Edge Computing

Computing at the edge is increasingly important since a massive amount o...
research
02/09/2023

Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading

Mobile edge computing (MEC) is a promising paradigm to meet the quality ...
research
04/14/2023

Spectrum-aware Multi-hop Task Routing in Vehicle-assisted Collaborative Edge Computing

Multi-access edge computing (MEC) is a promising technology to enhance t...
research
07/30/2021

An Edge-Based Resource Allocation Optimization for the Internet of Medical Things (IoMT)

As the number of Internet of Medical Things (IoMT) increases, the need f...
research
09/05/2022

To Compute or not to Compute? Adaptive Smart Sensing in Resource-Constrained Edge Computing

We consider a network of smart sensors for edge computing application th...

Please sign up or login with your details

Forgot password? Click here to reset