AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization

12/02/2022
by   Zhiying Xu, et al.
0

Traditional deep learning compilers rely on heuristics for subgraph generation, which impose extra constraints on graph optimization, e.g., each subgraph can only contain at most one complex operator. In this paper, we propose AGO, a framework for graph optimization with arbitrary structures to boost the inference performance of deep models by removing such constraints. To create new optimization opportunities for complicated subgraphs, we propose intensive operator fusion, which can effectively stitch multiple complex operators together for better performance. Further, we design a graph partitioning scheme that allows an arbitrary structure for each subgraph while guaranteeing the acyclic property among all generated subgraphs. Additionally, to enable efficient performance tuning on complicated subgraphs, we devise a novel divide-and-conquer tuning mechanism to orchestrate different system components. Through extensive experiments on various neural networks and mobile devices, we show that our system can improve the inference performance by up to 3.3x when compared with state-of-the-art deep compilers.

READ FULL TEXT

page 1

page 8

page 9

research
04/09/2022

Efficient Representation Learning of Subgraphs by Subgraph-To-Node Translation

A subgraph is a data structure that can represent various real-world pro...
research
10/22/2022

ALT: Boosting Deep Learning Performance by Breaking the Wall between Graph and Operator Level Optimizations

Deep learning models rely on highly optimized tensor libraries for effic...
research
01/05/2022

Planted Dense Subgraphs in Dense Random Graphs Can Be Recovered using Graph-based Machine Learning

Multiple methods of finding the vertices belonging to a planted dense su...
research
01/01/2022

FamilySeer: Towards Optimized Tensor Codes by Exploiting Computation Subgraph Similarity

Deploying various deep learning (DL) models efficiently has boosted the ...
research
06/29/2018

Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation

Generating scene graph to describe all the relations inside an image gai...
research
10/12/2020

Graph Information Bottleneck for Subgraph Recognition

Given the input graph and its label/property, several key problems of gr...
research
12/18/2021

Improving Subgraph Recognition with Variational Graph Information Bottleneck

Subgraph recognition aims at discovering a compressed substructure of a ...

Please sign up or login with your details

Forgot password? Click here to reset