Analyzing Query Performance and Attributing Blame for Contentions in a Cluster Computing Framework

08/28/2017
by   Prajakta Kalmegh, et al.
0

Analyzing contention for resources in a cluster computing environment accurately is critical in order to understand the performance interferences faced by a query due to concurrent query executions, and to better manage the workload in the cluster. Today no tools exist to help an admin perform a deep analysis of resource contentions taking into account the complex interactions among different queries, their stages, and tasks in a shared cluster. In this paper, we present ProtoXplore - a Proto or first system to eXplore the interactions between concurrent queries in a shared cluster. We construct a multi-level directed acyclic graph called ProtoGraph to formally capture different types of explanations that link the performance of concurrent queries. In particular, (a) we designate the components of a query's lost (wait) time as Immediate Explanations towards its observed performance, (b) represent the rate of contention per machine as Deep Explanations, and (c) assign responsibility to concurrent queries through Blame Explanations. We develop new metrics to accurately quantify the impact and distribute the blame among concurrent queries. We perform an extensive experimental evaluation using ProtoXplore to analyze the query interactions of TPCDS queries on Apache Spark using microbenchmarks illustrating the effectiveness of our approach, and illustrate how the output from ProtoXplore can be used by alternate scheduling and task placement strategies to help improve the performance of affected queries in recurring executions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/25/2016

Quegel: A General-Purpose Query-Centric Framework for Querying Big Graphs

Pioneered by Google's Pregel, many distributed systems have been develop...
research
12/16/2021

Predictive Price-Performance Optimization for Serverless Query Processing

We present an efficient, parametric modeling framework for predictive re...
research
10/08/2020

Improving Attention Mechanism with Query-Value Interaction

Attention mechanism has played critical roles in various state-of-the-ar...
research
05/18/2020

A Comparative Exploration of ML Techniques for Tuning Query Degree of Parallelism

There is a large body of recent work applying machine learning (ML) tech...
research
06/04/2022

Distributed processing of continuous range queries over moving objects

Monitoring range queries over moving objects is essential to extensive l...
research
09/23/2022

Concurrent Graph Queries on the Lucata Pathfinder

High-performance analysis of unstructured data like graphs now is critic...
research
08/20/2020

Towards Inferring Queries from Simple and Partial Provenance Examples

The field of query-by-example aims at inferring queries from output exam...

Please sign up or login with your details

Forgot password? Click here to reset