Adaptive ResNet Architecture for Distributed Inference in Resource-Constrained IoT Systems

07/21/2023
by   Fazeela Mazhar Khan, et al.
0

As deep neural networks continue to expand and become more complex, most edge devices are unable to handle their extensive processing requirements. Therefore, the concept of distributed inference is essential to distribute the neural network among a cluster of nodes. However, distribution may lead to additional energy consumption and dependency among devices that suffer from unstable transmission rates. Unstable transmission rates harm real-time performance of IoT devices causing low latency, high energy usage, and potential failures. Hence, for dynamic systems, it is necessary to have a resilient DNN with an adaptive architecture that can downsize as per the available resources. This paper presents an empirical study that identifies the connections in ResNet that can be dropped without significantly impacting the model's performance to enable distribution in case of resource shortage. Based on the results, a multi-objective optimization problem is formulated to minimize latency and maximize accuracy as per available resources. Our experiments demonstrate that an adaptive ResNet architecture can reduce shared data, energy consumption, and latency throughout the distribution while maintaining high accuracy.

READ FULL TEXT
research
10/15/2022

The Effects of Partitioning Strategies on Energy Consumption in Distributed CNN Inference at The Edge

Nowadays, many AI applications utilizing resource-constrained edge devic...
research
09/02/2021

Energy-Efficient Multi-Orchestrator Mobile Edge Learning

Mobile Edge Learning (MEL) is a collaborative learning paradigm that fea...
research
03/08/2022

YONO: Modeling Multiple Heterogeneous Neural Networks on Microcontrollers

With the advancement of Deep Neural Networks (DNN) and large amounts of ...
research
09/14/2021

Complexity-aware Adaptive Training and Inference for Edge-Cloud Distributed AI Systems

The ubiquitous use of IoT and machine learning applications is creating ...
research
03/08/2021

Hypervector Design for Efficient Hyperdimensional Computing on Edge Devices

Hyperdimensional computing (HDC) has emerged as a new light-weight learn...
research
07/10/2019

Dual Dynamic Inference: Enabling More Efficient, Adaptive and Controllable Deep Inference

State-of-the-art convolutional neural networks (CNNs) yield record-break...
research
01/13/2021

NetCut: Real-Time DNN Inference Using Layer Removal

Deep Learning plays a significant role in assisting humans in many aspec...

Please sign up or login with your details

Forgot password? Click here to reset