Pliant: Leveraging Approximation to Improve Datacenter Resource Efficiency

04/12/2018
by   Neeraj Kulkarni, et al.
0

Cloud multi-tenancy is typically constrained to a single interactive service colocated with one or more batch, low-priority services, whose performance can be sacrificed when deemed necessary. Approximate computing applications offer the opportunity to enable tighter colocation among multiple applications whose performance is important. We present Pliant, a lightweight cloud runtime that leverages the ability of approximate computing applications to tolerate some loss in their output quality to boost the utilization of shared servers. During periods of high resource contention, Pliant employs incremental and interference-aware approximation to reduce contention in shared resources, and prevent QoS violations for co-scheduled interactive, latency-critical services. We evaluate Pliant across different interactive and approximate computing applications, and show that it preserves QoS for all co-scheduled workloads, while incurring a 2.1% loss in output quality, on average.

READ FULL TEXT

page 4

page 8

page 9

page 10

research
05/27/2021

Sinan: Data-Driven, QoS-Aware Cluster Management for Microservices

Cloud applications are increasingly shifting from large monolithic servi...
research
08/31/2020

A Self-adaptive Approach for Managing Applications and Harnessing Renewable Energy for Sustainable Cloud Computing

Rapid adoption of Cloud computing for hosting services and its success i...
research
04/24/2018

Seer: Leveraging Big Data to Navigate the Increasing Complexity of Cloud Debugging

Performance unpredictability in cloud services leads to poor user experi...
research
08/29/2021

Leveraging Transprecision Computing for Machine Vision Applications at the Edge

Machine vision tasks present challenges for resource constrained edge de...
research
12/02/2019

MORPHOSYS: Efficient Colocation of QoS-Constrained Workloads in the Cloud

In hosting environments such as IaaS clouds, desirable application perfo...
research
11/26/2019

Intelligent Resource Scheduling for Co-located Latency-critical Services: A Multi-Model Collaborative Learning Approach

Latency-critical services have been widely deployed in cloud environment...
research
11/17/2020

AXES: Approximation Manager for Emerging Memory Architectures

Memory approximation techniques are commonly limited in scope, targeting...

Please sign up or login with your details

Forgot password? Click here to reset