Auto-Split: A General Framework of Collaborative Edge-Cloud AI

08/30/2021
by   Amin Banitalebi-Dehkordi, et al.
0

In many industry scale applications, large and resource consuming machine learning models reside in powerful cloud servers. At the same time, large amounts of input data are collected at the edge of cloud. The inference results are also communicated to users or passed to downstream tasks at the edge. The edge often consists of a large number of low-power devices. It is a big challenge to design industry products to support sophisticated deep model deployment and conduct model inference in an efficient manner so that the model accuracy remains high and the end-to-end latency is kept low. This paper describes the techniques and engineering practice behind Auto-Split, an edge-cloud collaborative prototype of Huawei Cloud. This patented technology is already validated on selected applications, is on its way for broader systematic edge-cloud application integration, and is being made available for public use as an automated pipeline service for end-to-end cloud-edge collaborative intelligence deployment. To the best of our knowledge, there is no existing industry product that provides the capability of Deep Neural Network (DNN) splitting.

READ FULL TEXT
research
07/03/2020

CacheNet: A Model Caching Framework for Deep Learning Inference on the Edge

The success of deep neural networks (DNN) in machine perception applicat...
research
03/08/2023

KubeEdge-Sedna v0.3: Towards Next-Generation Automatically Customized AI Engineering Scheme

The scale of the global edge AI market continues to grow. The current te...
research
07/16/2022

A Survey on Collaborative DNN Inference for Edge Intelligence

With the vigorous development of artificial intelligence (AI), the intel...
research
11/30/2022

An Efficient Split Fine-tuning Framework for Edge and Cloud Collaborative Learning

To enable the pre-trained models to be fine-tuned with local data on edg...
research
03/24/2022

ACE: Towards Application-Centric Edge-Cloud Collaborative Intelligence

Intelligent applications based on machine learning are impacting many pa...
research
09/08/2021

From Cloud to Edge: A First Look at Public Edge Platforms

Public edge platforms have drawn increasing attention from both academia...
research
01/15/2019

AI Pipeline - bringing AI to you. End-to-end integration of data, algorithms and deployment tools

Next generation of embedded Information and Communication Technology (IC...

Please sign up or login with your details

Forgot password? Click here to reset