TinyIREE: An ML Execution Environment for Embedded Systems from Compilation to Deployment

05/28/2022
by   Hsin-I Cindy Liu, et al.
0

Machine learning model deployment for training and execution has been an important topic for industry and academic research in the last decade. Much of the attention has been focused on developing specific toolchains to support acceleration hardware. In this paper, we present IREE, a unified compiler and runtime stack with the explicit goal to scale down machine learning programs to the smallest footprints for mobile and edge devices, while maintaining the ability to scale up to larger deployment targets. IREE adopts a compiler-based approach and optimizes for heterogeneous hardware accelerators through the use of the MLIR compiler infrastructure which provides the means to quickly design and implement multi-level compiler intermediate representations (IR). More specifically, this paper is focused on TinyIREE, which is a set of deployment options in IREE that accommodate the limited memory and computation resources in embedded systems and bare-metal platforms, while also demonstrating IREE's intuitive workflow that generates workloads for different ISA extensions and ABIs through LLVM.

READ FULL TEXT
research
11/02/2022

Edge Impulse: An MLOps Platform for Tiny Machine Learning

Edge Impulse is a cloud-based machine learning operations (MLOps) platfo...
research
03/09/2021

DISC: A Dynamic Shape Compiler for Machine Learning Workloads

Many recent machine learning models show dynamic shape characteristics. ...
research
02/27/2021

HIR: An MLIR-based Intermediate Representation for Hardware Accelerator Description

The emergence of machine learning, image and audio processing on edge de...
research
09/26/2018

Relay: A New IR for Machine Learning Frameworks

Machine learning powers diverse services in industry including search, t...
research
05/18/2022

Confidential Machine Learning within Graphcore IPUs

We present IPU Trusted Extensions (ITX), a set of experimental hardware ...
research
01/19/2018

A Collective Knowledge workflow for collaborative research into multi-objective autotuning and machine learning techniques

Developing efficient software and hardware has never been harder whether...
research
04/15/2022

CEDR – A Compiler-integrated, Extensible DSSoC Runtime

In this work, we present CEDR, a Compiler-integrated, Extensible Domain ...

Please sign up or login with your details

Forgot password? Click here to reset