RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs

04/10/2023
by   Jiaqi Yin, et al.
0

Deep neural networks (DNNs) have substantial computational and memory requirements, and the compilation of its computational graphs has a great impact on the performance of resource-constrained (e.g., computation, I/O, and memory-bound) edge computing systems. While efficient execution of their computational graph requires an effective scheduling algorithm, generating the optimal scheduling solution is a challenging NP-hard problem. Furthermore, the complexity of scheduling DNN computational graphs will further increase on pipelined multi-core systems considering memory communication cost, as well as the increasing size of DNNs. Using the synthetic graph for the training dataset, this work presents a reinforcement learning (RL) based scheduling framework RESPECT, which learns the behaviors of optimal optimization algorithms and generates near-optimal scheduling results with short solving runtime overhead. Our framework has demonstrated up to ∼2.5× real-world on-chip inference runtime speedups over the commercial compiler with ten popular ImageNet models deployed on the physical Coral Edge TPUs system. Moreover, compared to the exact optimization methods, the proposed RL scheduling improves the scheduling optimization runtime by up to 683× speedups compared to the commercial compiler and matches the exact optimal solutions with up to 930× speedups. Finally, we perform a comprehensive generalizability test, which demonstrates RESPECT successfully imitates optimal solving behaviors from small synthetic graphs to large real-world DNNs computational graphs.

READ FULL TEXT

page 3

page 4

page 5

research
08/19/2023

Accelerating Exact Combinatorial Optimization via RL-based Initialization – A Case Study in Scheduling

Scheduling on dataflow graphs (also known as computation graphs) is an N...
research
11/28/2021

Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU

With the fast development of deep neural networks (DNNs), many real-worl...
research
01/17/2023

Robust Scheduling with GFlowNets

Finding the best way to schedule operations in a computation graph is a ...
research
01/31/2023

Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning

Many real-time applications (e.g., Augmented/Virtual Reality, cognitive ...
research
08/13/2019

Reinforcement Learning based Interconnection Routing for Adaptive Traffic Optimization

Applying Machine Learning (ML) techniques to design and optimize compute...
research
09/02/2021

Agon: A Scalable Competitive Scheduler for Large Heterogeneous Systems

This work proposes a competitive scheduling approach, designed to scale ...
research
06/11/2019

Automatic Model Parallelism for Deep Neural Networks with Compiler and Hardware Support

The deep neural networks (DNNs) have been enormously successful in tasks...

Please sign up or login with your details

Forgot password? Click here to reset