Jarvis: Large-scale Server Monitoring with Adaptive Near-data Processing

02/12/2022
by   Atul Sandur, et al.
0

Rapid detection and mitigation of issues that impact performance and reliability is paramount for large-scale online services. For real-time detection of such issues, datacenter operators use a stream processor and analyze streams of monitoring data collected from servers (referred to as data source nodes) and their hosted services. The timely processing of incoming streams requires the network to transfer massive amounts of data, and significant compute resources to process it. These factors often create bottlenecks for stream analytics. To help overcome these bottlenecks, current monitoring systems employ near-data processing by either computing an optimal query partition based on a cost model or using model-agnostic heuristics. Optimal partitioning is computationally expensive, while model-agnostic heuristics are iterative and search over a large solution space. We combine these approaches by using model-agnostic heuristics to improve the partitioning solution from a model-based heuristic. Moreover, current systems use operator-level partitioning: if a data source does not have sufficient resources to execute an operator on all records, the operator is executed only on the stream processor. Instead, we perform data-level partitioning, i.e., we allow an operator to be executed both on a stream processor and data sources. We implement our algorithm in a system called Jarvis, which enables quick adaptation to dynamic resource conditions. Our evaluation on a diverse set of monitoring workloads suggests that Jarvis converges to a stable query partition within seconds of a change in node resource conditions. Compared to current partitioning strategies, Jarvis handles up to 75 improving throughput in resource-constrained scenarios by 1.2-4.4x.

READ FULL TEXT
research
04/21/2010

Multi-Criteria Evaluation of Partitioning Schemes for Real-Time Systems

In this paper we study the partitioning approach for multiprocessor real...
research
02/28/2022

Stream Containers for Resource-oriented RDF Stream Processing

We introduce Stream Containers inspired by the Linked Data Platform as a...
research
08/07/2021

Building Analytics Pipelines for Querying Big Streams and Data Histories with H-STREAM

This paper introduces H-STREAM, a big stream/data processing pipelines e...
research
10/29/2021

SDP: Scalable Real-time Dynamic Graph Partitioner

Time-evolving large graph has received attention due to their participat...
research
04/04/2023

Diba: A Re-configurable Stream Processor

Stream processing acceleration is driven by the continuously increasing ...
research
05/31/2021

System-aware dynamic partitioning for batch and streaming workloads

When processing data streams with highly skewed and nonstationary key di...
research
03/01/2023

On the Semantic Overlap of Operators in Stream Processing Engines

Stream processing is extensively used in the IoT-to-Cloud spectrum to di...

Please sign up or login with your details

Forgot password? Click here to reset