ALERT: Accurate Learning for Energy and Timeliness

10/31/2019
by   Chengcheng Wan, et al.
0

An increasing number of software applications incorporate runtime Deep Neural Networks (DNNs) to process sensor data and return inference results to humans. Effective deployment of DNNs in these interactive scenarios requires meeting latency and accuracy constraints while minimizing energy, a problem exacerbated by common system dynamics. Prior approaches handle dynamics through either (1) system-oblivious DNN adaptation, which adjusts DNN latency/accuracy tradeoffs, or (2) application-oblivious system adaptation, which adjusts resources to change latency/energy tradeoffs. In contrast, this paper improves on the state-of-the-art by coordinating application- and system-level adaptation. ALERT, our runtime scheduler, uses a probabilistic model to detect environmental volatility and then simultaneously select both a DNN and a system resource configuration to meet latency, accuracy, and energy constraints. We evaluate ALERT on CPU and GPU platforms for image and speech tasks in dynamic environments. ALERT's holistic approach achieves more than 13 reduction, and 27 the application or system level. Furthermore, ALERT incurs only 3 consumption and 2 perfect application and system knowledge.

READ FULL TEXT
research
10/31/2019

ALERT: Accurate Anytime Learning for Energy and Timeliness

An increasing number of software applications incorporate runtime Deep N...
research
01/23/2018

Flexible Deep Neural Network Processing

The recent success of Deep Neural Networks (DNNs) has drastically improv...
research
03/15/2012

Automatic Tuning of Interactive Perception Applications

Interactive applications incorporating high-data rate sensing and comput...
research
05/08/2021

Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms

Mobile and embedded platforms are increasingly required to efficiently e...
research
05/08/2021

Incremental Training and Group Convolution Pruning for Runtime DNN Performance Scaling on Heterogeneous Embedded Platforms

Inference for Deep Neural Networks is increasingly being executed locall...
research
12/05/2018

ECC: Energy-Constrained Deep Neural Network Compression via a Bilinear Regression Model

Many DNN-enabled vision applications constantly operate under severe ene...
research
02/03/2023

DynaMIX: Resource Optimization for DNN-Based Real-Time Applications on a Multi-Tasking System

As deep neural networks (DNNs) prove their importance and feasibility, m...

Please sign up or login with your details

Forgot password? Click here to reset