AI on the Edge: Rethinking AI-based IoT Applications Using Specialized Edge Architectures

03/27/2020
by   Qianlin Liang, et al.
0

Edge computing has emerged as a popular paradigm for supporting mobile and IoT applications with low latency or high bandwidth needs. The attractiveness of edge computing has been further enhanced due to the recent availability of special-purpose hardware to accelerate specific compute tasks, such as deep learning inference, on edge nodes. In this paper, we experimentally compare the benefits and limitations of using specialized edge systems, built using edge accelerators, to more traditional forms of edge and cloud computing. Our experimental study using edge-based AI workloads shows that today's edge accelerators can provide comparable, and in many cases better, performance, when normalized for power or cost, than traditional edge and cloud servers. They also provide latency and bandwidth benefits for split processing, across and within tiers, when using model compression or model splitting, but require dynamic methods to determine the optimal split across tiers. We find that edge accelerators can support varying degrees of concurrency for multi-tenant inference applications, but lack isolation mechanisms necessary for edge cloud multi-tenant hosting.

READ FULL TEXT

page 2

page 5

research
09/16/2023

Exploration of TPUs for AI Applications

Tensor Processing Units (TPUs) are specialized hardware accelerators for...
research
07/26/2021

AI Multi-Tenancy on Edge: Concurrent Deep Learning Model Executions and Dynamic Model Placements on Edge Devices

Many real-world applications are widely adopting the edge computing para...
research
07/21/2020

AI Tax: The Hidden Cost of AI Data Center Applications

Artificial intelligence and machine learning are experiencing widespread...
research
07/04/2022

Sustainable AI Processing at the Edge

Edge computing is a popular target for accelerating machine learning alg...
research
08/05/2022

Hardless: A Generalized Serverless Compute Architecture for Hardware Processing Accelerators

The increasing use of hardware processing accelerators tailored for spec...
research
05/16/2023

Accelerating Communications in Federated Applications with Transparent Object Proxies

Advances in networks, accelerators, and cloud services encourage program...
research
07/17/2020

Klessydra-T: Designing Vector Coprocessors for Multi-Threaded Edge-Computing Cores

Convolutional computation kernels are fundamental to today's edge comput...

Please sign up or login with your details

Forgot password? Click here to reset