SEIFER: Scalable Edge Inference for Deep Neural Networks

10/21/2022
by   Arjun Parthasarathy, et al.
0

Edge inference is becoming ever prevalent through its applications from retail to wearable technology. Clusters of networked resource-constrained edge devices are becoming common, yet there is no production-ready orchestration system for deploying deep learning models over such edge networks which adopts the robustness and scalability of the cloud. We present SEIFER, a framework utilizing a standalone Kubernetes cluster to partition a given DNN and place these partitions in a distributed manner across an edge network, with the goal of maximizing inference throughput. The system is node fault-tolerant and automatically updates deployments based on updates to the model's version. We provide a preliminary evaluation of a partitioning and placement algorithm that works within this framework, and show that we can improve the inference pipeline throughput by 200 resource-constrained nodes. We have implemented SEIFER in open-source software that is publicly available to the research community.

READ FULL TEXT
research
04/24/2023

Partitioning and Deployment of Deep Neural Networks on Edge Clusters

Edge inference has become more widespread, as its diverse applications r...
research
01/18/2022

DEFER: Distributed Edge Inference for Deep Neural Networks

Modern machine learning tools such as deep neural networks (DNNs) are pl...
research
10/21/2022

Partitioning and Placement of Deep Neural Networks on Distributed Edge Devices to Maximize Inference Throughput

Edge inference has become more widespread, as its diverse applications r...
research
10/10/2021

SplitPlace: Intelligent Placement of Split Neural Nets in Mobile Edge Environments

In recent years, deep learning models have become ubiquitous in industry...
research
02/11/2018

Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms

This paper introduces partitioning an inference task of a deep neural ne...
research
05/21/2022

SplitPlace: AI Augmented Splitting and Placement of Large-Scale Neural Networks in Mobile Edge Environments

In recent years, deep learning models have become ubiquitous in industry...
research
08/21/2022

Memristive Computing for Efficient Inference on Resource Constrained Devices

The advent of deep learning has resulted in a number of applications whi...

Please sign up or login with your details

Forgot password? Click here to reset