SCOPE: Safe Exploration for Dynamic Computer Systems Optimization

04/22/2022
by   Hyunji Kim, et al.
0

Modern computer systems need to execute under strict safety constraints (e.g., a power limit), but doing so often conflicts with their ability to deliver high performance (i.e. minimal latency). Prior work uses machine learning to automatically tune hardware resources such that the system execution meets safety constraints optimally. Such solutions monitor past system executions to learn the system's behavior under different hardware resource allocations before dynamically tuning resources to optimize the application execution. However, system behavior can change significantly between different applications and even different inputs of the same applications. Hence, the models learned using data collected a priori are often suboptimal and violate safety constraints when used with new applications and inputs. To address this limitation, we introduce the concept of an execution space, which is the cross product of hardware resources, input features, and applications. To dynamically and safely allocate hardware resources from the execution space, we present SCOPE, a resource manager that leverages a novel safe exploration framework. We evaluate SCOPE's ability to deliver improved latency while minimizing power constraint violations by dynamically configuring hardware while running a variety of Apache Spark applications. Compared to prior approaches that minimize power constraint violations, SCOPE consumes comparable power while improving latency by up to 9.5X. Compared to prior approaches that minimize latency, SCOPE achieves similar latency but reduces power constraint violation rates by up to 45.88X, achieving almost zero safety constraint violations across all applications.

READ FULL TEXT

page 9

page 12

page 16

page 17

page 18

research
05/10/2023

MoCA: Memory-Centric, Adaptive Execution for Multi-Tenant Deep Neural Networks

Driven by the wide adoption of deep neural networks (DNNs) across differ...
research
04/11/2022

Cello: Efficient Computer Systems Optimization with Predictive Early Termination and Censored Regression

Sample-efficient machine learning (SEML) has been widely applied to find...
research
02/03/2021

Llama: A Heterogeneous Serverless Framework for Auto-Tuning Video Analytics Pipelines

The proliferation of camera-enabled devices and large video repositories...
research
04/27/2020

FORECASTER: A Continual Lifelong Learning Approach to Improve Hardware Efficiency

Computer applications are continuously evolving. However, significant kn...
research
03/15/2012

Automatic Tuning of Interactive Perception Applications

Interactive applications incorporating high-data rate sensing and comput...
research
02/20/2020

From Stateless to Stateful Priorities: Technical Report

We present the notion of stateful priorities for imposing precise restri...
research
12/09/2022

Information-Theoretic Safe Exploration with Gaussian Processes

We consider a sequential decision making task where we are not allowed t...

Please sign up or login with your details

Forgot password? Click here to reset