Modeling Performance and Energy trade-offs in Online Data-Intensive Applications

08/18/2021
by   Ajay Badita, et al.
0

We consider energy minimization for data-intensive applications run on large number of servers, for given performance guarantees. We consider a system, where each incoming application is sent to a set of servers, and is considered to be completed if a subset of them finish serving it. We consider a simple case when each server core has two speed levels, where the higher speed can be achieved by higher power for each core independently. The core selects one of the two speeds probabilistically for each incoming application request. We model arrival of application requests by a Poisson process, and random service time at the server with independent exponential random variables. Our model and analysis generalizes to today's state-of-the-art in CPU energy management where each core can independently select a speed level from a set of supported speeds and corresponding voltages. The performance metrics under consideration are the mean number of applications in the system and the average energy expenditure. We first provide a tight approximation to study this previously intractable problem and derive closed form approximate expressions for the performance metrics when service times are exponentially distributed. Next, we study the trade-off between the approximate mean number of applications and energy expenditure in terms of the switching probability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2021

On the Age of Information of a Queuing System with Heterogeneous Servers

An optimal control problem with heterogeneous servers to minimize the av...
research
12/13/2019

Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization

GPU-accelerated computing is a key technology to realize high-speed infe...
research
06/14/2021

Age of Information for Multiple-Source Multiple-Server Networks

Having timely and fresh knowledge about the current state of information...
research
12/20/2019

A QoS-aware workload routing and server speed scaling policy for energy-efficient data centers: a robust queueing theoretic approach

Maintaining energy efficiency in large data centers depends on the abili...
research
02/10/2021

Adaptive Processor Frequency Adjustment for Mobile Edge Computing with Intermittent Energy Supply

With astonishing speed, bandwidth, and scale, Mobile Edge Computing (MEC...
research
12/08/2022

Age of Information with On-Off Service

This paper considers a communication system where a source sends time-se...
research
12/02/2021

A Foreground-Background queueing model with speed or capacity modulation

The models studied in the steady state involve two queues which are serv...

Please sign up or login with your details

Forgot password? Click here to reset